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Fretace 


pants ” discrete mathematics It is sinended ae as a dee back which could be used 
at undergraduate level, probably in the second year of an English undergradu- 
ate mathematics course. Some textbooks on discrete mathematics are written 
primarily for computing science students, but the present book is intended for 
students following a mathematics course. The place of discrete mathematics in 
the undergraduate curriculum is now fairly well established, and it is certain 
that its place in the curriculum will be maintained in the third millennium. 

Discrete mathematics has several aspects. One fundamental part is 
enumeration, the study of counting arrangements of various types. We might 
count the number of ways of choosing six lottery numbers from 1,2,... , 49, or 
the number of spanning trees in a complete graph, or the number of ways of 
arranging 16 teams into four groups of four. We develop methods of counting 
which can deal with such problems. 

Next, graph theory can be used to model a variety of situations — road 
systems, chemical molecules, timetables for examinations. We introduce the 
basic types of graph and give same indication of what the important. pronerties 
are that a graph might possess. 

The third area of discrete mathematics to be discussed in this book is that of 
configurations or arrangements. Latin squares are arrangements of symbols 
in a particular way; such arrangements can be used to construct experimental 
designs, magic squares and tournament designs. This leads us on to have a look 
at block designs, which were discussed extensively by statisticians as well as 
mathematicians due to their usefulness in the design of experiments. The book 
closes with a brief introduction to the ideas behind error-correcting codes. 

The reader does not require a great deal of technical knowledge to be able 
to cope with the contents of the book. A knowledge of the method of proof by 
induction, an acquaintance with the elements of matrix theory and of arithmetic 
modulo n, a familiarity with geometric series and a certain clarity of thought 
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should see the reader through. Often the main problem encountered by the 
reader is not in the depth of the argument, but in looking at the problem in 
the “right way”. Facility in this comes of course with practice. 

Each chapter ends with a good number of examples. Hints and solutions to 
most of these are given at the end of the book. The examples are a mixture of 
fairly straightforward applications of the ideas of the chapter and more chal- 
Jenging problems which are of interest in themselves or are of use later on in 
the book. 

My hope is that this text will provide the basis for a first course in discrete 


mathematics Obviously the choice of material for such a course is denendant 
Matnematics. OVIOUELY tne Choice Of Materia: tOr Such &@ COUTSe 1s aepenacnt 


on the interests of the teacher, but there should be enough topics here to enable 
an appropriate choice to be made. The text has been influenced in countless 
ways by the many texts that have appeared over the years, but ultimately 
it is determined by my own preferences, likes and dislikes, and by my own 
experience of teaching discrete mathematics at different levels over many years, 
from masterclasses for 14-year-olds to final year honours courses. 

I would like to thank the Springer staff for their encouragement to write this 
book and for thcir help in its production. Thanks is also duc to Gail Henry for 
converting my manuscript into a ST pX file, and to Mark Thomson for reading 
and commenting on many of the chapters. 


University of Glasgow, June 2000 
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Counting and Binomial Coefficients 


In this chapter we introduce the basic counting methods, the factorial func- 
tion and the binomial coefficients. These are of fundamental importance to the 
subject matter of subsequent chapters. We start with two basic principles. 


1.1 Basic Principles 


(a) The multiplication principle. Suppose that an activity consists of k 
stages, and that the ith stage can be carried out in a; different ways, irrespective 
of how the other stages are carried out. Then the whole activity can be carried 
out in Q)Q2...0,% Ways. 


Example 1.1 


A restaurant serves three types of starter, six main courses and five desserts. 
So a three-course meal can be chosen in 3 x 6 x 5 = 90 ways. 


(b) The addition principle. If A),...,A, are pairwise disjoint sets (i.e. 
A; A; = @ wherever i 4 j), then the number of elements in their union is 


k 
|Ay UU Agl = [Ay] +--+ + [Ag] = 5° Ail. 
i=] 
Example 1.2 


In the above example, how many different two-course meals (including a main 
course) are there? 
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Solution 


There are two types of two-course meal to consider. Let A, denote the set of 
meals consisting of a starter and a main course, and let Ag denote the set of 
meals consisting of a main course and a dessert. Then the required number is 


JA, UA2| = [Ai] +]Ae| (by the addition principle) 
— ({%y @\14 fAy BY ther tha multinliioatinn neincinlel 
a. \v On Vy) ks 5 Vy Vv) \vy LiLo JELULUIP LIC GLU Pilstvipic) 
= 48. 
19 Ractoriale 
1.2 BKactoriais 


How many ways are there of placing a,b and c in a row? There are six ways, 
namely 
abc, acb, bac, bea, cab, cba. 


Note that there are three choices for the first place, then two for the second, 
and then just one for the third; so by the multiplication principle there are 
3 x 2 x 1 =6 possible orderings. In general, if we define n! (“n factorial”) by 


n! = n(n —-1)(n- 2)... 2.1 


then we have 


Theorem 1.1 


The number of ways of placing n objects in order is nl. 


Four people, A, B, C, D, form a committee. One is to be president, one secretary, 
one treasurer, and one social convener. In how many ways can the posts be 
assigned? 


Solution 


Think of first choosing a president, then a secretary, and so on. There 4! = 24 
possible choices. 


The value of n! gets big very quickly: 
5!= 120, 10!= 3628800, 50! = 3.04 x 10%. 


This is an example of what is called combinatorial explosion: the number of 
different arrangements of n objects gets huge as n increases. The enormous size 
of n! lies behind what is known as the travelling salesman problem which will 
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be studied further in Chapter 4. A traveller sets out from home, has to visit n 
towns and then return home. Given the mileages between the towns, how does 
the traveller find the shortest possible route? The naive approach of considering 
each of the n! possible routes is impracticable if n is large, so another approach 


is needed. 
In certain problems, only some of a given set of objects are to be listed. 


Example 1.4 


A competition on the back of a cereal packet lists ten properties of a car, and 
asks the consumer to choose the six most important ones, listing them in order 
of importance. How many different entries are possible? 


Solution 

There are ten possibilities for the first choice, then nine for the second, and so 

on down to five for the sixth. So, by the multiplication principle, the number 

of possible lists is : 
- 10! 

10x9x8x7x6x5= FP = 151200. 


In general, we have: 


Theorem 1.2 
The number of ways of selecting r objects from n, in order but with no repeti- 
tions, is 
n! 
n(n —1)...(n-—7r +1) = fn 
Example 1.5 


A committee of 4 is to be chosen, as in Example 1.3, but this time there are 20 
people to choose from. The number of choices of president, secretary, treasurer, 
social convener is 


t 
20 x 19 x 18 x 17 = Te = 116 280. 


1.3 Selections 


Suppose that in Example 1.5 we wanted just the number of ways of choosing 
four people for a committee, not bothering about the positions they might fill. 
Denote by (7 ) (and read as 20-choose-4) the number of ways in which we can 
choose 4 from 20 where order does not matter. 
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Each such choice of four from 20 can be ordered in 4! ways to give an assign- 
ment to particular positions within the committee, so, by Example 1.5, 


20\ 20! 
! Sree 
ax (7) 6! 
Thus 


This argument is general: 2 = r! x ("), so we have the following general 
formula. 


Theorem 1.3 


Let (*) denote the number of unordered selections of r from n where repetitions 


are not allowed. Then 
n n! 
(1) = ear Lt) 


Since some students find the idea of turning ordered selections into unordered 
selections confusing, here is another way of deriving the formula (1.1). 

Suppose we have to choose a team of r players from a pool of n, one of them 
to be appointed captain. This can be done by first choosing the team - and 
there are (") ways of doing this - and then choosing the captain - there are r 
ways of doing this. So there are r(") choices altogether. But we could, instead, 
first choose the captain - there are n ways of doing this - and then choose the 
r — 1 other members of the team - and there are ("~}) ways of doing this. So 
the number of choices is also n("7}). Thus 


3 a Cary) 
(")=2(°7)) a2 


nae OE 


so that 


But similarly, ( ), on replacing n by n—1 and r by r—1, so we 


get 


(0) 2 BSp Aged. (8-9). 


Since (7) is clearly always m, we obtain finally 


(1) = Mears n 


r rl ri(n-r)! 
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as before. 
[Note in passing that this is a good example of the technique of counting 
the same thing in two different ways.| 


Example 1.6 
In the UK National Lottery, a participant chooses six of the numbers 1 to 49; 
order does not matter. So the number of possibie choices is 

ss 49 49 x 48 x 47 x 46 x 45 x 44 
E ege 


Al = 13 983 816. 
\9/ : 


So there is roughly one chance in 14 million of winning the jackpot! 


Example 1.7 
How likely is it that next week’s lottery winning numbers will be disjoint from 
this week’s? i 
et . 
Solution 


There are (*?) possible selections next week. The number of these which are 


disjoint from this week’s must be G2). since six of the 49 numbers are ruled 


out. Since all ee) selections are equally likely, the required probability is 
. 43\ (49 
= 0.436.... 
(3)/(6) 


Example 1.8 


Binary sequences. There are 2” n-digit binary sequences since each of the n 
digits is 0 or 1. For example, the eight binary sequences of length three are 


000 001 O10 011 100 101 110 111. 


(a) How many binary sequences of length 12 contain exactly six 0s? 


(b) How many have more 0s than 1s? 


Solution 


(a) The six Os occupy six of the 12 positions. There are CA = 924 choices of 
these six positions, and this gives the number required. 


(b) There are 2)” — 924 = 3172 sequences with unequal numbers of 0s and 1s. 
By symmetry, exactly half of these, i.e. 1586, will have more Os than Is. 
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We close this section with two simple properties of the numbers (") which arise 
out of the fact that (”) is the number of ways of choosing r from n. From now 
on, we follow the conventions that 0! = 1 and that (0) =1 for alln > 0. 


Theorem 1.4 

(i) es = aan for allr, O<r<n. 

(ii) es = (") +( i ) for allr, O< r <n. 
pe Of wT, r—l, 

Proof 


(i) This foliows immediately from the formuia (i.i); alternatively, simpiy ob- 
serve that choosing r from n is just the same as selecting the n — r which 
are not to be chosen! 


(ii) A choice of r of the n+ 1 objects 21,...,2n+41 May or may not include 
Tn41- If it does not, then r objects have to be chosen from 7,... , Zn and 
there are (7) such choices. If it does contain 2,41, then r—1 further objects 
have to be chosen from 21,... ,2n, and there are (,",) such choices. The 
result now follows from the addition principle. 


Alternatively we can use the formula (1.1): 


OA) spent pane een 


ni(n —r +1) nir 
ri(n-r+i1)! rim—r+))! 


n! 4 (n+1)! 
= Aeon eiy a early ara Be 


1.4 Binomial Coefficients and Pascal’s Triangle 


The choice numbers (") are known as binomial coefficients. In this section 


we find out why. Note that 


Os “= 
Gee gy? oso coefficients: 1 
(lt+y)) = l+y 1 1 
(l+y)? = 14+2y+y? 1 2 1 
(l+y)) = 14+3y43y?+y3 1 3 3 #1 
1 4 6 4 


(l+y)? = 14 4y + 6y? + 4y? + y4 
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Note the triangular array of coefficients, known as Pascal’s triangle. Each 
row consists of the choice numbers: e.g. the bottom row shown consists of 


(0) 21, (Ja (9 (0-6 (1 


Pascal (1623-1662) certainly studied these numbers, making use of them in his 
work on probability, but the triangle was known to Chinese mathematicians 
long before. 


Theorem 1.5 


The binomial theorem 


(zt+y)" = (5) xz" + (ants (3) gy pt (7) yr = > O tae. 


Proof 


(e+y)" =(xt+y)(z +y)--- (2 + y) (n brackets). So the coefficient of z?~"y? 
in the expansion is the number of ways of getting z"~"y” when the n brackets 
are multiplied out. Each term in the expansion is the product of one term from 
each bracket; so z"~"y” is obtained as many times as we can choose y from r 
of the brackets (and z from the remaining n — r brackets). But this is just the 
number of ways of choosing r of the n brackets, which is ("). 


Corollary 1.6 


(ty) = Eo (")v- 


1 


Example 1.9 
1+ (P)2+ (P)2? +--+ (19) 22° = (1 +.2)!0 = 3h, 
Pascal’s triangle 


1 , , «© row 0 
1 1 + rowl 


1 7 2 35 35 21 7 1 + row7 
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In the nth row, n > 0, the entries are the binomial coefficients (7),0 <r <n. 
The triangle displays the two properties of Theorem 1.4: (i) is shown in the 
reverse symmetry of each row, and (ii) is shown by the fact that each entry in 
the triangle is the sum of the two entries immediately above it (e.g. 21 = 6+15). 
Note also the entries in each row add up to a power of 2. 


- 


Theorem 1.7 

eae (es Cy Ses 
/ \/ \4/ nj 

ai (She (2) Sake ()S6-Gi So) 
/ \L/ \4/) \n/) 

Proof 


(i) Put y = 1 in Corollary 1.6. (ii) Put y = —1. 


The next result establishes a pattern relating to the diagonals in Pascal’s 
triangle. Note, for example, that in a line parallel to the left side of the triangle, 
we have 14+34+6+4 10+ 15 = 35. 


Theorem 1.8 


For all m > 0 and n > 1, $ 


n m+n\  {m+nt+1 
mi) \ m4+l1 J 


Proof 
as = a + ae by Theorem 1.4(ii) 
m+1 nt 
m+n m+n-1 m+n-1 ii 
<2) ¢(C82 (20) ee 
ee Ne ( oe 
= ee eeeh + 
ss m mt+1 
‘oa (mtn 4 cc ) (") 
= 4 seep + ’ 
m ee es 
since (771) = (™) =1. 
Identities 


The binomial theorem can be used to obtain other identities involving binomial 
coefficients. 
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Example 1.10 
Consider the identity 


(1+2)"(1+2)" =(1+<2)*" 


Le. 
rfn\ fn fr\ arp f(r\  (r\ (r\ ay 2n /On\ : 
+( i Jetertt Jere th, fet +, dep xr 
1 (o) (4) (n) 0 1 n r 
r=0 
Dauating tha anaefSrianéed nf eT An anch aide of thie ttantite: cispac 
myuanu 1s Wc JILATILLD UL & Uli Call OAUT UL Ul duUciiuiLy BiySs 


(*)'+(*) +-4(*)'- ee 


For example, with n = 4 we get 


7244467447417 =70= ee 


A similar type of argument enables us to obtain a corresponding result for 


alternating sums. 


Example 1.11 
Use the identity (1 — 27)" = (1 — 2)"(1+ 2)" to sum 


Solution. © 


Consider the coefficient of x” on both sides of the given identity. 
Since the right-hand side is 


(1- (e+ (Set +a (“290+ (er ("2 


the coefficient of x” is 
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The coefficient of x” on the left of the given identity is 0 if n is odd (why?) but 
is (—1)? ( : ) if n is even. So we have 


n 
2 


() GY +) --eerQ)={ @ oes 


2 2 2 2 2 2 
5 5 
ee - te + (? - 6 + (°) - ( =0 fobviousiy): 
V/s \ \ N Xs \ 


Nagi ead liptp ae (2), 
y] \2/ 


-(3) 


_ 


1.5 Selections with Repetitions 


We have already seen that there are 2™ binary sequences of length m. Here we 
are choosing m digits in order, and there are two choices (0 or 1) for each. 


Example 1.12 


The number of subsets of a set of m elements is 2". For each subset corresponds 
to a binary sequence of length m, in which the ith digit is 1 precisely when the 
ith element is in the subset. For example, the subset {2,3,5} of {1,2,3, 4,5} 
corresponds to 01101. Which subsets are represented by (a) 11100, (b) 00000? 
[{1, 2, 3}, @]. 


Example 1.13 

Each weekend in February I can visit any one of three cinemas. How many 
different, sequences of visits are possible, repeat, visits of course being allowed? 
Solution 


Each weekend I have 3 choices, so by the multiplication principle the eet 
number of visiting sequences is 34 = 81. 
Clearly we have 
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Theorem 1.9 


The number of ways of choosing r objects from n, in order and with repetitions 
allowed, is n”. 


Suppose now that we have to choose r objects from n, where repetitions are 
allowed, but where order does not matter. 


Example 1.14 


There are ten ways of choosing two objects from {1,2,3,4} unordered, with 
repetitions allowed. They are: 


1,1 1,2 1,3 1,4 2,2 2,3 2,4 3,3 3,4 4,4. 


Theorem 1.10 


The number of unordered choices of r from n, with repetitions allowed is 


oe 
‘ : 
Proof 


Any choice will consist of 2, choices of the first object, x2 choices of the second 
object, and so on, subject to the condition z; +-°--+2, =r. So the required 
number is just the number of solutions of the equation 7; +--+ +2, =r in 
non-negative integers 2;. 

Now we can represent a solution 21,...,£, by a binary sequence: 

210s, 1, 220s, 1, 230s,1,... ,1, 270s. 

Think of the is as indicating a move from one object to the next. For example, 
the solution 2, = 2,22 = 0,23 = 2,24 = 1 of z} +22 +23 +24 = 5 corresponds 
to the binary sequence 00110010. Corresponding to z} +:-- +2, = 1, there 
will be n — 1 1s and r Os, and so each sequence will be of length n + r — 1, 
containing exactly r Os. Conversely, any such sequence corresponds to a non- 
negative integer solution of 7; +---+ 2, = r. Now the r Os can be in any of 
the n + r—1 places, so the number of such sequences, and hence the number 
of unordered choices, is ("*”~"), the number of ways of choosing r places out 
ofn+r—-1. 

This proof also establishes the following result. 


Theorem 1.11 


The number of solutions of x1 +--- +a, = 7 in non-negative integers 2; is 


co) | 
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Example 1.15 a 


The number of solutions of + y + z = 17 in non-negative integers is 
17+3-1 19 19 
Or") = Gr) = (2) = 


Example 1.16 


How many solutions are there of z + y + z = 17 in positive integers? 


Solution 


Here we require z > ly >1,z>1,s0 we putz =1l+u,y=l+v,z=l1lt+w. 
The equation becomes u + v + w = 14, and we seek solutions in non-negative 
integers u,v,w. The number of solutions is therefore 


14+3-1 16 
("43") = (em 


Example 1.17 


How many binary sequences are there containing exactly p 0s and g 1s, g > p-1 
with no two Os together? 


Solution 


Imagine the g 1s placed in arow:...1...1...1.... They create g+1 “boxes” 
(spaces) in which to put the Os (g — 1 boxes between the 1s, and one at each 
end). The p Os have to be places in different boxes, so the number of choices 
is (+4), 

Alternatively, we could first place the p 0s in a row. They create p + 1 boxes 
in which to place the g 1s. But the p ~ 1 internal boxes must receive at least 
one 1, If we let x; denote the number of 1s placed in the ith box, we want 
the number of solutions of 2; +--- + ap41 = q, where 2 > 0, tp41 > 0, and 
all other z; > 1, Putting 21 = y1, Zpi1 = yps1, Ti = 1+ y; otherwise, the 
equation becomes y; +--+ p41 = ¢—(p-1) =q—p+tl], and the number of 
non-negative integer solutions is 


ees) = qt+1 )= Co) 
q-pt+l1 q+1-p aw 


We summarise the selection formulae in Table 1.1. 
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Table 1.1 Summary of formulae for choosing r from n. 


choose r 
from n 


ordered unordered 


no 


repetitions - ( \ 
(w=m} Ar) 
allowed 


repetitions 


allowed | 


1.6 A Useful Matrix Inversion 


In this final section we present an elegant, and useful matrix result which will be 
of use later. It enables us to deduce, from a relation of the form an = >, (p) bx, 
an expression for b, in terms of the a;. 

To begin, first consider the matrix 


See ee 
wnre © 


which is clearly constructed from the first four rows of Pascal’s triangle. Re- 
markably, its inverse is 


1 0 00 
-l1 1 #00 
1-2 10 
-l1 3 -3 1 


since the product of these two matrices is easily checked to be the identity. This 
result generalises in the obvious way. Before proving this, we need a couple of 
lemmas. 


Lemma 1.12 
For all i,j,k, 9 <k <i, 


(GIG) G5): 
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t\ (kV i! Ko i! 
() (5) ~ RG KYM -J)! (i — )INK — 9)! 


i! (i-j)! 
jG — 9)! (i — k) Mk — 9)! 
fin fi-—4N 


~ Gea} s&s 


Lemma 1.13 


Dycrci () G)(-D* = 0 whenever j < i. 


EOE Ne iP byteane to 
inn pan (lag 
»G) 


t—. 
C3 \¢ (-1)' on putting £=k —j 
£=0 


Proof 


oes (:) (Gen 


It 
— , 


* ‘ya 1-7 =0 since j <i. 


We extend the definition of (3) to cases where i < j by putting ( ‘) = 0. This 
makes sense, since there is no way of choosing j obiects from i < 7 without 
repetitions. 


Theorem 1.14 

Let A be the (n + 1) x (n + 1) matrix, with rows and columns labelled by 
0,1,...n, defined by a,; = (5). Let B be the (n +1) x (n+ 1) matrix defined 
by 6; = (-1)'*4(5). Then BA = I. 


Proof 


The (i, 1)th entry of BA is the product of the ith row of B and the ith column 
of A, and so is >, bixaai = >, (—1)*t*(} ie ). There is only one value of k 
for which (;) and (7) are both nonzero, namely k = i, so the sum reduces to 
(—1)?*({) (}) = 1. So all diagonal entries of BA are 1. 
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Ifi # j, the (i, 7)th entry of BA is >, (—1)*+* (2) (*) =(-1)'¥, (}) Qep 
This is 0, by Lemma 1.12, whenever i > j. But if i <j then every term (;) (3) 
is 0, so again the sum is 0. 


So, now that Theorem 1.14 is established, suppose we are given two sequences 
(ap,@:,@2,...) and (80, b:,b2,...) related by 


UXT e 


for all n > 0. We then have the matrix identity 


an bn 


so by Theorem 1.14 we can deduce that 


bo bo a 
by by ay 
bn bn Gn 


The bottom row then gives 


— Sau @ Gp. 


We therefore have the following useful inversion. 


Corollary 1.15 
If Gn = Dopo (7)be for all n > 0, then by = pg(—1)"** (“)ag. 


We shall use this inversion result in Sections 2.4 and 5.3. 


Exercises 


Exercise 1.1 


The 10 cabinet ministers of Newland sit around a circular table. One seat 
is reserved for the Prime Minister. In how many ways can the remaining 
nine seat themselves? 
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Exercise 1.2 


An 8-person committee is to be formed from a group of 15 women and 
12 men. In how many ways can the committee be chosen if 

(a) the committee must contain four men and four women? 

(b) there must be at least two men? 

(c) there must be more women than men? 


Exercise 1.3 


An eccentric, Mr. Leuben, is reported to have bet that if he shuffled 
a pack of cards long enough the cards would eventually appear in a 


Pee Ee: POEM, RET EEO OS OMTRME Wp Yan HORNED: PENGNT AoE | p ER UINaET EO en arm CEE, YE 
Bivel Oraci. Me Lied tur iV MOUrS @ Uay 1Ul 2 years UCIUIS CVCILualy 


succeeding after 4 146 028 shuffles. Was he lucky? 


Exercise 1.4 


Find the probabilities of getting exactly (a) 3, (b) 4, (c) 5 numbers 
correct in the UK National Lottery. 


Exercise 1.5 


Estimate your chances of picking the winning numbers in the following 
lotteries: 


(a) Sweden - choose 7 from 35; (b) Hungary - choose 5 from 90. 


Exercise 1.6 


In the Thunderball variation of the lottery, you choose five from 1 to 
34, and one from 1 to 14. Compare your chance of winning Thunderball 
with your chances of winning the UK Lottery. 


Exercise 1.7 


What is the probability of getting exactly 5 heads in 10 tosses of a coin? 


Exercise 1.8 


Give an induction proof of the binomial theorem. (You will need to use 
Theorem 1.4(ii).) 
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Exercise 1.9 
Deduce from Theorem 1.7 that (¢) + (8) + ({) +--- = 271. 


Exercise 1.10 

Use the identity (1+z)"(1+2)* = (1+a)"** to derive the Vandermonde 
identity, )74 (i) (non) = Cn’): / 
Deduce that 74 (f) (421) = (71). 

Exercise 1.11 

(a) Check that (q) - (7) + (2) - (3) = -20= -(). 


(b) Generalise this result to (5) — (7) + ($3) —---+(—1)*(@) = (-1)*(";") 
by replacing each (7) by (77) + ("1). 


(c) Give an alternative proof by considering the coefficient of z* on both 
sides of the identity (1+2)®"! =(1+2)"(1+2)7!. 


Exercise 1.12 
Use (1.2) to prove that 37>, k(7) (7) =n(™t277). 


Exercise 1.13 


Find the number of solutions of the equation + y + z+w = 15 (a) in 
non-negative integers, (b) in positive integers, (c) in integers satisfying 
@>2,y >-2,z2>0,w > —-3. 


Exercise 1.14 


Find the number of non-negative integer solutions of 4; +22+23+24 < 6. 


Exercise 1.15 


Show that more than half of the selections of 6 from 49 in the UK Lottery 
have no two consecutive numbers. 


Exercise 1.16 


Find the number of ways of placing 4 marbles in 10 distinguishable boxes 
if 
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(a) the marbles are distinguishable, and no box can hold more than one 
marble; 
(b) the marbles are indistinguishable, and no box can hold more than 


one; 
(c) the marbles are distinguishable, and each box can hold any number 


of them; 


ing (1.2), prove that 7"_,r(") = n2"-!. Why is this result obvious 
6&4 3B ra" Ars YY 


when we consider the average size of a subset of a set of n elements? 


Exercise 1.18 ‘ | 


Establish the formula 1? | 2? + --- | n? = En(n + 1)(2n + 1) by the 
following methods. ‘ 


(a) Prove that ("$?) — (3) =1r?, and sum over r. 


(b) Prove that (5) + ("t') =r?, and sum over r, using Theorem 1.8. 


Exercise 1.19 
(For those who know de Moivre’s theorem.) 


Let w = 4(—1+i/3), so that w3 = 1 and w?+w+1 = 0. Putz = ly =w 
in the binomial theorem to show that 


(3) +(5)+(t) += $e tea). 


(Hint: 1+ w=e*.) 


nF Seteeggrtyegimearn stem perapES 
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Recurrence 


It often happens that, in studying a sequence of numbers a,, a connection 
between a,, and an_;, or between a, and several of the previous a;,i <n, is 
obtained. This connection is called a recurrence relation; it is the aim of this 
chapter to illustrate how such recurrences arise and how they may be solved. 


2.1 Some Examples 


Example 2.1 (The towers of Hanoi) 


We begin with a problem made famous by the nineteenth century French math- 
ematician E. Lucas. Consider n discs, all of different sizes, with holes at their 
centres (like old gramophone records), and three vertical poles onto which the 
discs can be slipped. Initially all the discs are on one of the poles, in order of 
size, with the largest at the bottom, forming a tower. It is required to move the 
discs, one at a time, finishing up with the n discs similarly arranged on one of 
the other poles. There is the important requirement that at no stage may any 
disc be placed on top of a smaller disc. What is the minimum number of moves 
required? 

Let a, denote the smallest number of moves required to move the n discs. 
Then clearly a; = 1. Also, ag = 3: move the top disc to one pole, the bottom 
to the other, and then place the smaller on top of the larger. What about ay? 
It should be clear that, to be able to move the bottom disc, there has to be an 
empty pole to move it to, and so all the other n — 1 discs must have been moved 
to the third pole. To get to this stage, a, moves are needed. The largest disc 
is then moved to the free pole, and then another a,~; moves can position the 
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other n — 1 discs on top of it. So 
Qn = 2an-) +1. 


This recurrence relation, along with the initial condition a; = 1, enables 
us to find a,. We have ag = 2.14+1=3,a3 = 2.34+1=7,44 =2.7+1= 15, 
and it appears that a, = 2" — 1. This can be confirmed by induction, or by 
iteration: 


a, = 14 2an-) =14+2(1+ 2an_-2) = 1424+ 2?an_2 
= 1424 27(14 2a, 2) = 14+2+2?+ Ba, 3 
= 14242? +--.42"-72 42" 1a, 
= 142+4+27+---4+2" 


In the mythical story attached to the puzzle, n was 64 and priests had to move 
discs of pure gold; when all was accomplished, the end of the world would 
come. But 2° — 1 = 18 446 744073 709551615, and at one move per second 
the process would take about 5.82 x 10"! years; so we have nothing to worry 
about! This is another good example of combinatorial explosion. y 


Example 2.2 


There are 3" n-digit sequences in which each digit is 0,1 or 2. How many of 
these sequences have an odd number of 0s? 


Solution 


Let 6, denote the number of such sequences of length n with an odd number 
of 0s. Each such sequence ends in 0,1 or 2. A sequence ending in 1 has any 
of the bp; sequences of length 1 — 1 preceding the 1; and similarly there are 
bn—1 sequences ending in 2. If a sequence ends in 0, the 0 must be preceded 
by a sequence of length n — 1 with an even number of 0s; but the number of 
such sequences is 3"—? (the total number of sequences of length n — 1) minus 
bn—1 (the number of sequences of length n—1 with an odd number of 0s); thus 
there are 3"~! — b,_; sequences ending in 0. So, by the addition principle, 


bn = bn-1 tbn-1 +3" — bp) Le. bp = bp + 377?” 
Again we can find 5, by iteration: 
bp = 37) 4 bps = 3"! + (89°? + b,_2) Se. 
= 3") 4 37-2 4 + 3h + by. 


But b; = 1 (why?), so 


1 
bn =14+34---43"! = 5(3" - 1). 
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Example 2.3 (Paving a garden path) 


A path is 2 metres wide and n metres long. It is to be paved using paving 
stones of size 1m x 2m. In how many ways can the paving be accomplished? 


Solution 

Tat m. donate tha ni umber of pavings afan ym noth Oaarle mn, — 1 dines ane 
LCL Pn MLLIVUL the LULL) UE poy Set hed ViaaaAn tt RP Uble Witalily Fil a 1 DALIUVE VLC 
paving stone fills the path. Also, pp = 2, the two possibilities being shown in 


Figure 2.1(a), and p3 = 3 (Figure 2.1(b)). 


le hf 4 ie 
fae) ed aes 
(a) 


Figure 2.1 


It might appear that p, = n for all n, but check now that pq = 5. What is pn? 
For a 2 x n path, the paving must start with one of the options shown in 
Figure 2.2. 


Figure 2.2 


In the first. case it can be completed in p,_; ways; in the second it can be 
completed in p,-2 ways. So, again by the addition principle, 


Pn = Pn-1 + Pn-2 (n > 3). 


This is a second order recurrence relation, since each p, is given in terms of 
the previous two. We obtain pp = 5+ 3 = 8, pg = 84+5=13,p7 =134+8= 
21, etc; the sequence (p,) thus turns out to be the well-known Fibonacci 
sequence (F;,): 

1,2,3,5,8, 13, 21, 34, 55,89,.... 


Fibonacci, or Leonardo of Pisa (c. 1200 AD) introduced this sequence when 
investigating the growth of the rabbit population (see Exercise 2.5); it crops 
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up amazingly frequently in diverse mathematical situations. We shall obtaina 
formula for F, in the next section. 


Example 2.4 (Flags) 


A flag is to consist of n horizontal stripes, where each stripe can be any one 
of red, white and blue, no two adjacent stripes having the same colour. Under 
these conditions, the first (top) stripe can be any of three colours, the second 
has two possibilities, the third has two, and so on (each stripe avoiding the 
colour of the one above it); so there are 3 x 2"—! possible designs. 

Suppose now that, in order to avoid possible confusion of flying the flag 
upside-down, it is decreed that the top and bottom stripes should be of different 


colours. Let a. denote the number of such flags with n strines. Then a, =4 
coi0urs. LCt @y Genowe tac numoer Of sucn hags wito 7m Suripes. 1nCn a) =v 


(why?) and a2 = 6. Further, since there is a one-to-one correspondence between 
flags of n stripes with bottom stripe same as top, and flags of n— 1 stripes with 
bottom stripe different from top, 


a, = 3x2"7? ~ (no. of flags with bottom colour same as top colour) 
= 3x2"? — (no. of flags of n — 1 stripes with bottom colour 
different from top). 


Thus 
Qn = 3.2") — ayy. (2.1) 
We could iterate again (try it!), but here is another method. Since 
Qn + An) = 3.2%) 


we also have 
Qn—1 + An—-2 = 3.2"? 


whence 
2(@n—1 + Qn—2) = 3.2"7) = an + Op_1. 


Thus 
An = An-1 + 24n_-2. ; (2.2) 


This again is a second order recurrence relation; we now show how to solve it. 
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2.2 The Auxiliary Equation Method 


In this section we concentrate on recurrence relations of the form 
n = A@n-1 + Ban-2 (n > 3) (2.3) 


where A, B are constants, B 4 0, and where a; and az are given, Equation (2. 3) 


ae palled « sarah a asedas inane represen ce walatinn sith constant coeGiriant 


is called a second order linear recurrence relation with constant Coenicients; it 
turns out that there is a very neat method of solving such recurrences. 

First, we ask: are there any real numbers a # 0 such that a, = a” satisfies 
(2.3)? Substituting a, = a” into (2.3) gives a” = Aa”! + Ba"™-?, ie. a? = 
Aoa+ B. Thus a, = a” is a solution of (2.3) precisely when a@ is a solution of 
the auxiliary equation 


x? = Ax +B. (2.4) 


Thus if a and £ are distinct roots of (2.4),a, = a" and a, = B" both satisfy 
(2.3). If the auxiliary equation has a repeated root a, then 


zg? — Ag—B=(r-a)? =2" —2ar+a? 
so that A = 2a and B = —a?. In this case a, = na” also satisfies (2.3), since 
Aan-1 + Ban_2 = A(n — 1)a™} + B(n— 2)a"-? 
= 2(n —- 1a” — (n—2)a" = na” = ay. 


We now prove 


Theorem 2.1 


Suppose (a,,) satisfies (2.3), and that a; and a2 are given. Let a, 8 be the roots 
of the auxiliary equation (2.4). Then 


{i) ifa # B, there are constants K,, K2 such that a, = K,a" + K28" for all 
n>; 


(ii) if a = B, there are constants K3, K4 such that a, = (K3 +nK4)a® for all 
n>. 


Proof 
(i) Choose K,, K2 so that a, = K,a + K2B, a2 = Ko? + K2?, ie. take 


K — uB-a = MATa 
Fa = Fa B) 


a(B— a)’ vel 
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Then the assertion that a, = Kia" + K26” is certainly true for n = 1,2. We 
now proceed by induction. Assume the assertion is true for all n < k. Then 


Gp41 = Aay + Bay) = A(Kia* + KoB*) + B(Kia*—! + K26*-') 
= K,a*-1(Aa + B) + K26*"'(AB + B) 
= Katt! + K pF, 
so the result follows. 
(ii) Choose K3,Kq so that A; = (K3 + Ka)a, a2 = (K3 + 2Ka)o”, ie. take 


2a)Q — a2 ag ~- aQ@ 
——, k= ——. 


Ks= a2? a? 


(2.6) 


Then the assertion that a, = (K3 + nK4)a” is certainly true for n = 1,2. 
Assume it is true for all n < k. Then 


Qny1 = Aay + Bay = A(K3 + kKy)a* + B(K3 + (k-1)Ka)a*? 


= K3a*~'(Aa + B) + Kya*!(Aka + B(k —1)) 


= Kza*t} + Kyo} ane a*(k —1)) ? 
= Ksa*t) + Ky(k + lo, a 
as required. 
Example 2.4 (continued) 
In the flag problem we obtained the recurrence relation a, = @n—, + 2@n-2, 
where a; — Q,ac — 6. The auxiliary equation 2? — z—2 = 0 has solutions 


hada taal fei wows sed ON ee SSA ea See ee ee w aul 
a=-1,8 =2,s80 
Qn, = Ky (-1)" + K22” 


where 0 = —K, + 2Kq and 6 = K, + 4Ko, ie. K; = 2, Kz =1. So 


Qn = 2-1)" + 2". 


Example 2.3 (continued) 


The Fibonacci sequence (F,,) is given by 
Fi=1, hh =2, Fy= Fat Fo-2 (n>3). 
The auxiliary equation x? ~ z — 1 = 0 has solutions }(1 + V5), so 


F, = Kya" + Kop” 
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where a = 4(1+ V5), 8 = 4(1 — V5). The initial condition F, = 1, F) = 2, 
along with (2.5), yield Ky = Ye Ka = Fe, 80 that 


5 iia ON ede ee ie 
fy= Jgat' — Fapmet = (128) -7e( ) . (2.7) 


This result may seem rather odd since F, is to be an integer. Check that 
expansion by the binomial theorem leads to a cancellation of all terms involving 


V5, giving 


nek P) ACP) eCE)-} 


This again is a surprise since ijt is by no means obvious that the sum of binomial 
coefficients should be divisible by 2”. 
Note that, since |3| < 1, the second term in (2.7) tends to 0 as n — oo, giving 


Faa4r  1+V5 
~ 
Fa 2 


, the golden ratio. 


Example 2.5 


Solve the recurrence relation a, = 4a¢n—1 — 4Gn_2 (n > 3), a; = 1,42 = 3. 


Solution 
The auxiliary equation is z? — 4 + 4 = 0, ie. (x — 2)? =0, 80 | 
a, = (Ki + nK2)2”. 


The initial conditions give 1 = 2(K, + K2), 3 = 4(K, + 2K2), whence K, = 
K, = }. Thus 
Gy = (n +:1)27-?, 


The auxiliary equation method extends to higher order recurrences in the ob- 
vious way. 


Example 2.6 
Suppose that a, = 3, a2 = 6,a3 = 14 and, for n > 4, 
Qn = 64n_1 — 1la,_2 + Gan_3. 


Then the auxiliary equation is 2? —62”+112—6 = 0, i.e. (x —1)(x—2)(x—3) = 0, 
80 @y, = K, + K22" + K33". Using the initial conditions, we get 


On = 14277 4.377). 
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Non-homogeneous recurrence relations 


The auxiliary equation method has been used for recurrence relations such a 
An, = Gn—-1 + 2Gn_2. These are homogeneous linear recurrences with constant 
coefficients: an is a linear combination of some of the previous a;. We nov 
briefly consider the non-homogeneous case, e.g. 


Qn = Aan_) + Bayn_2 + tn, 


where ¢,, is some function of n. One example of this was (2.1), which we solved 
by manipulating it into a second order homogeneous recurrence; but now we 
give an alternative method of solution. For we can obtain a solution by first 
finding the solution of the recurrence relation obtained by replacing t, by 0, and 


then adding to it any particular solution of the non homogeneous recurrenec. 
then adding to if any particular sociution Of the non-nomogeneous recurrence. 


Example 2.4 (again) 


We solve an = ~Gn_; + 3.2"-!, ay = 0. 


Solution 


First we solve an = —@n-1. We could use the auxiliary equation z = -, 
but it is easy just to spot that a, = (—1)"~1q, ie a, = K(-1)". Fora 
particular solution of a, = —a,_-1 + 3.2"~1, we try something sensible such 
as dn = A2”. Substituting gives A2"” = —A2"~! + 3.2"-!, whence A = 1. So 
we have a, = K(—1)" + 2”. Since a; = 0, we need K = 2; so we have finally 
2(-1)” + 2”, as before. 

Sais that the initial conditions are not opie until the final stage of the 

procedure. 


+ 


2.3 Generating Functions 


The generating function of a sequence a), @2,a3,... is defined to be 


f(z) = Do aist. 


w=1 
For example, the generating function of the Fibonacci sequence is 


ot 2a? +3274 5044... 


If a sequence starts with ao we take f(r) = °°, a:2*; for example, the gener- 
ating function of the sequence a, = 2” (n > 0) is 


1427+ 2°27? +.- 


~ Ton" 
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Sometimes, given a recurrence relation, it is possible to find the generating 
ations such as function of the sequence and then to find a, by reading off the coefficient of 
with constant x" 

is a;. We now 


Example 2.4 (yet again!) 


Consider the recurrence relation a, = 3.2"-! — ay_; (n > 2), a, = 0. Let 


z) =a,@ + agz* +--+. Then 
hich we solved (a) : E 


3 but now we 
lution by first 
igt, by 0, and 
us recurrence. 


f(x) = aya + (3.2 — a,)x? + (3.2? — ag)? +--- 
= ayz + 3(2z? + 272% +--+) — (az? + age? +--+) 
=04627(1 + 22 + 2?2? +...) — af(z). 


Thus (1+ 2) f(r) = te so that 


1 2 1 
= 6. Qe es =? 2 
f(z) i. (1+ 2)(1 — 2x) a G 22 I+”) 
on using the method of partial fractions. Thus 

tion s = -1, ‘ aie 5 7 
(-1)". For a f(z) = 42°(1+ 22 + 2°r te) + Oe (l-a+a°—-.---). 
sensible such Reading off the coefficient of 2” gives 
ice A= 1. So 

Gy, = 4.2°-? + 2(—1)"-? = 2” + 2(-1)", 


‘e have finally, 


4 fore. 
I stage of the epelers 


Example 2.5 (again) 
An = 4a,-} = 4ayn-2 (n > 3), a1 = 1, a2 = 3. 
f(z) = az + agx? +a3z° +aqz4+--- 


x + 3x” + (4az — 4a,)23 + (403 — 4an)z7+--- 


i 


z+ 3x? + 4(az* + agrt +--+) — 4(a; 2° + agzt +---) 


z+ 32° + 42(f (x) — ayz) ~ 42? f(z), 


8 so that 
(1 — 42 + 42”) f(r) = 2432? ~ 42? = 2-27. 
Thus 
ile, the gener- z— 2 


a Now, since 
1 2 
—=aliriz tees, 
l-z 
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differentiating gives 


aaa =14+2r4+3r74+---, 
so that 1 
Gain = 142.20 43.2727 +---. 
Thus 
f(z) =(e@—27)(1+2.2¢ + 3.272? 4+ 4.2823 +.-- 
whence 
Gn = coefficient of 2°! in 1+2.2r+--- 
— coefficient of 2"? in 1+ 2.27 + 
=n.2%) — (n~1)2"? = (n+ 1)2"? 
as before. 


2.4 Derangements 


| 


Suppose that n people at a party leave their coats in the cloakroom. After the 
party, they each take a coat at random. How likely is it that no person gets the 


correct coat? 


A derangement of 1,... ,n is a permutation 7 of 1,... ,n such that m(i) 4i 
for each i. For example, there are nine derangements of 1, 2,3,4: 


413 


bo 


1 4 3 
3°64 «#1 
1 4 2 


421 


a 
Nw 
ia) 


3.1 


ee FF FP CD W CO NW 
es 
Ne 


3.2 1 


In each of these 1 is not in the first place, 2 is not in the second, and so on. 
Let d, denote the number of derangements of 1,... ,n. Then (check!) 


dq,=0, d2=1, d3=2, dy =9. 
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Our aim is to obtain a recurrence relation for the d; and then use it to obtain 
a formula for d,. Before proceeding to the recurrence relation, note that d,, is 
the number of ways of assigning n objects to n boxes, where, for each object, 
there is one prohibited box, and where each box is prohibited to just one object. 
Above, the objects and boxes are both labelled by 1,... ,m, with box (position) 
i prohibited to object (number) i, but the labelling of the boxes and objects is 
of course arbitrary and does not affect the problem. 

Next note that in three of the nine derangements of 1,2,3,4 listed above, 
4 swaps places with another number: this happens in 2143, 3412 and 4321. In 
the remaining derangements 4 does not swap places with another. With this 
in mind, we put 

dy =ent fn 
where e,,, f,, denote the numbers of derangements of 1,... ,n in which n swaps, 
does not swap, places with another. Now if n swaps places with i (and there are 
n—1 possible choices for i), the remaining n — 2 numbers have to be deranged, 
and this can be done in d,_2 ways; so 


en = (n ac 1)dn_2- 


If n does not swap places with any other, then some r goes to place n (and 
there are n — 1 choices of r), while n does not go to place r. So we have to assign 
places to 1,... ,n, excluding r, where the places available are 1,... ,m —1, and 
where each has precisely one forbidden place (for i # r,n, place 7 is forbidden; 
for i = n, place r is forbidden). So there are d,_1 possible arrangements, and 
so 

fn = (n — 1)dy_y. 


Thus by the addition principle, we have 
l dh = (n —1)(dp_1 fad. 3) 
i ( ( } 


Using this recurrence we get 


> 
i) 
00 
Rute 


ds =4(9+2)=44, dg = 5(44 +9) = 265, 


and so on. 

The recurrence (2.8) does not permit the use of the auxiliary equation 
method, since the coefficients of d,_, and dp_2 are not constants. However, 
we can manipulate (2.8) into a more manageable form. Equation (2.8) can be 
rewritten as 

d,,— ndy-1 = ~(dn-1 = (n = 1)dn_2), 


where the expression on the right is the negative of the expression on the left, 
with n replaced by n — 1. So iteration gives 


dy — Ndp—1 = —(d ot ee (n i 1)dn_2) 


(—1)?(dn-2 — (n — 2)dn_s) 
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pee 


= (-1)""? (dz — 2d,) = (-1)"(1 — 0) = (-1)", 
i.e. 7 
odes (2.9) 


Thus 2, ig 
dn _an-1 é (-1)" 
n! (n-1)! nt 

If we now sum the identities 
dm = dm-1 (-1)™ 
mi (m—1)! m! 


over m = 2,3,...,n, we get cancellations on the left, giving 


ni i Ql 3! n! 


d, d ~1)? (-1)3 —1)" 2. (-1)™ = (-1)™ 
ie MENS a Ue deeat alk i = oF 


But d,; = 0, so we obtain 


One interesting consequence of (2.10) is that, as n —- oo, 
1 

—-4-, ; 
e iy 

so the probability of no one getting their own coat back after the party tends 


to : = 0.367 88 as n -+ co. Indeed, for n as small as 6, 


dg 265 _ 
Sr on 0.368 06, 
: an A : 
agreeing with Fi to 3 decimal places. 
Example 2.7 
(a) Find the number of permutations of 1,...,n in which exactly k of the 


numbers are in their correct position, and deduce that 
= nm 
!= dy. 2.11 


(b) What is the average number of numbers in their correct position in a random 
permutation of 1,--- ,n? 
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Solution 


(a) There are (7) ways of choosing the & numbers to be fixed. The remaining 
n — k have to be deranged, and this can be done in dp_, ways. So there are 
(?)dn—z permutations with exactly k fixed numbers. 

But any of the n! permutations fixes k numbers for some k between 0 and n. 
So 


on putting 2 =n — k. 
(b) The average number of fixed numbers in a permutation of 1,... ,n is 


1 n * 1 n yoy 
LS e(sjoaedSer(Ser 


ni 
k=0 


i< n-1 
=>; n(i _ 1) dat by (1.2) 


1 eS eet 
=a D ( t a (on putting 2 = n — k) 


———(n-1)!__ by(2.11) 


So the average number of fixed numbers is 1. 


Alternative proofs of (2.10) 


A proof of (2.10) using the inclusion-exclusion principle will be given in Chapter 
6. Here we give yet another proof, a simple application of the inversion principle, 
as in Corollary 1.15, applied to (2.11). 

In (2.11), put a, for n! and 6, for d,. Then (2.11) is 


a nm 

an = S (i) bk, 
k=O 

so that, by Corollary 1.15, 


n=Beam (uBio gts 


k=0 k=0 
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=n! iy = (on putting =n — k) 
£=0 i 
“ (=1) 
=n! Se a 
£=0 é 
t 


2.5 Sorting Aigorithms 


Given a pile of exam scripts, we might want to sort them, i.e. put them in 
increasing or decreasing order of marks. Are there any efficient ways of doing 
this? We start with a simple but not very efficient procedure. x 


Bubblesort 


Take a list of n numbers, in random order. Compare the first two, swapping 
them round if they are not in increasing order. Then compare the second and 
third numbers, again swapping if necessary. In this way proceed up the se- 
quence; the largest number will then be at the end. Next repeat the whole 
process for the first nm — 1 numbers: this will take the second largest to the 
second last position. Repeat for the first n — 2, and so on. 

The total number of comparisons involved in this procedure is 


i 1 
(n-1)+(n-2)+-- “42415 5n(n-1) = rile 


so we say that the bubblesort algorithm has O(n”) complexity. 


Example 2.8 


Start with 7, 10, 4,6, 3. 
After the first 4 comparisons we have 7, 4, 6, 3, 10. 
After the next 3 comparisons we have 4, 6, 3, 7, 10. 
After the next 2, we have 4, 3, 6,7, 10. ’ 
After the final comparison we have 3, 4, 6, 7,10. 


Mergesort 


The idea here is to split the given list into two (roughly) equal parts, sort each 
separately, and then merge (combine) them. 

The process of combining two sorted lists of lengths £ and m into one list can 
be accomplished by +m — 1 comparisons. For suppose we have two such lists, 
both in increasing order. Compare the first (smallest) numbers in the lists, and 
take the smaller as the first member of a new list L, crossing it out of its original 
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position. Repeat the process to find the second member of L, and so on. The 
number of comparisons is clearly 2+ m — 1, since when only one number from 
the two original lists is left no comparison is necessary. 

Before the merging takes place, the two halves of the original list can be 
sorted by a similar method. Let ¢, denote the number of comparisons needed 
to sort a list of m members by this method. If we split n into @ +k, then 
thotette +2+k-l=tet+tetn-1. 

Thus, if we consider the particular case where n = 2™, so that the lists can 
be bisected at each stage, we have 


ton = 2tom-1 + (2™ _ 1). 
Put a,, = tom; then the recurrence relation becomes 
Am = 2am—1 + (2 — 1). : (2.12) 


Using the method of Section 2.2, first solve the homogeneous recurrence ay, = 
2am-1- The solution is clearly a, = A2" for some constant A. We then have 
to find a particular solution of (2.12). Try 


Qn = Bn2"+C. 


(Trying an = B.2"+C would not work, since a, = 2” is already a solution of 
the homogeneous recurrence; so we take the hint given by Theorem 2.1(ii) and 
insert n.) We then require 


Bn2” + C = 2B(n — 12-1420 +2" —-1 


i.e. 

0 = —B.2"4+ 2" -14C. 
So take R = C' = 1 to obtain finally a, = A.2%+n2"+41. But a, = 1, so 
A= -1, giving 


a, = 2"(n-1) 41. 
Thus tom = 1+ 2™(m — 1). On putting n = 2, we get 
| tn = 1+ (logy n — 1), 


so the mergesort method has complexity O(nlogn), an improvement on the 
O(n?) of bubblesort. 
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2.6 Catalan Numbers 


In this section we introduce a well-known sequence of numbers known as th 
Catalan numbers, which arise as the counting numbers of a remarkable numbe 
of different types of structure. They are named after the Belgian mathematicia 
E.C. Catalan (1814-1894) who discussed them in his publications, but they had 
been studied earlier by several mathematicians, including Euler in his work a 
triangulating polygons (to be discussed shortly). 

We describe fully one of the occurrences of Catalan numbers, and begin with 


the following casy problem. 


Example 2.9 
How many “up-right” routes are there from A to B in Figure 2.3? 


B(5,3) 


A(0, 0) ; ae 
Figure 2.3 1 
Hee 

Solution 
Byean “upright route wemeana path from. Ate PB following edges of squarcs. 
always moving up or to the right. Any path must consist of 8 moves, 5 of which 


must be to the right, and 3 up. So the total number of possible routes is (8). 


More generally, the number of up-right routes from the bottom left vertex to 
the top right vertex of an m x n array is ("*"). 

Suppose we now have a square n x m array, and ask for the number p, 
of up-right paths from bottom left to top right which never go above the 
diagonal AB. In the case n = 3, shown in Figure 2.4, there are 5 such routes 
represented by RURURU, RURRUU, RRUURU, RRURUU, RRRUUU where 
R,U stand respectively for right, up. Thus p3 = 5. What is p,? 

Any qualifying route (let’s call it a good route) from A to B must “hit” 
the diagonal at some stage before B, even if it is only at A. So consider any 
good route from A to B, and suppose that, prior to reaching B, it last met 
the diagonal at the point C(m,m) where 1 < m < n. Then there are py, 
possibilities for the part of the route between A and C. The route must then 
proceed to D(m + 1,m), and eventually to E(n,n — 1), but it must never go 
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Z 


Figure 2.4 


nN a rr SS 


pias are p,—m-—1 good routes from D to E. See Figure 2.5. 


| (4 
cn 


Figure 2.5 


he 


By the multiplication principle, the number of good routes from A to B, with 
(m,m) as the last contact with the diagonal before B, is therefore pmDPn—m-1- 
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Since m can take any value from 0 to n — 1, it now follows from the addition 
principle that, with pp = 1, 


n—-1 
Pn = > PmPn—m-1- (2.13) 
m=0 . 4 
This recurrence relation differs from the ones met so far, but we can use gen- 
erating functions to solve it. Let ffm) be the coneratine function 
crating functions to solve it. Let f(a) be the generating function 


Then 
(po + pit + pox? +---)(po + pix + pot? +--+) 


~h 
w 
~~ 
& 
~ 
tl 


Cc : 
= S°2"(popn + PiPn—1 +°-* + PnPo) : 


n=0 a 


oo 
= So Pat” by (2.13). 
n=0 


Thus 2 f?(xz) = [pg Pn4it"*t! = f(x) — 1, whence 
xf?(z) — f(z) +1=0. 


Solving this quadratic, we obtain 


_ltvi-%& 1 


f(z) iG we (1 — 4x)? }. 


| 


sea 2 . : 1 
We have to take the negative sign to avoid having a term of the form — in 
Fi 


f(z). So 


1 11 42? 11 3 4323 
POE ge Og Eg 5 gy a a gage | 
= Jl gg Ll Pet 1 1 38 eet } ‘ 
2x ‘2 2° OL oe SI 
= 142.40 ,1°3 Ma? 13 5 ae! 
22°22 «3°22 2 4! | 


Thus, for n > 1, 


_ 13.5....(20~1) , 28 7 
Po = ay = ae te Gn- 0) 


7 2m (any! _ fn 
~  (n+1)! nt nt 1\n/] 
Thus, for example, ps = 1(6) =5andp= 1 (8) = 14. Note also that pp =1 


fits in with the convention that (8) =1. . 
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The numbers p,, are the Catalan numbers, usually denoted by C,,. Thus 


1 2n 
Cn = = (*). (2.14) 


The sequence (C'n)n>o begins 
1,1, 2,5, 14,42, 139,429,.... 
From (2.13) we have 
Cm = CoCm-1 + CiCm-2 +++ + Cm—-1Co. (2.15) 


As remarked earlier, the Catalan numbers appear in many situations. One im- 
mediate interpretation, obtained by replacing HR and U by 0 and 1 respectively, 
is: 

C, = number of binary sequences of length 2n containing exactly n 0s and n 
1s, such that at each stage in the sequence the number of 1s up to that point 
never exceeds the number of 0s. 


Euler’s interest was in the following: 

Cn-2 = number of ways of dividing a convex n-gon into triangles by drawing 
n—3non-intersecting diagonals. For example, the C’3 ways of triangulating a 
pentagon are shown in Figure 2.6. 


OAVROR SKS, 


Figure 2.6 


See Exercise 2.16 for this problem and Exercise 2.17 for another appearance 
of Ch. 


Another derivation of the formula (2.14) 


We close this section by pointing out that there is an alternative ingenious 
method of counting good up-right routes, due to D. André (1887). It avoids 
the rather awkward recurrence relation (2.13), instead making use of a clever 
mirror principle. 

The number of good routes from A(0,0) to B(n,n) which do not cross the 
diagonal AB is the total number (7”) of up-right routes from A to B minus 
the number of routes which do cross AB. Let’s call routes which cross AB bad 
routes. 
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F  B(n,n) 


mele 
eae . 
WZ oo 


A(0, 0) 


Figure 2.7 


Consider any bad route. There will be a first point on that route above the 
diagonal AB; suppose this is the point R(m,m + 1). If we replace the part of 
the route from A to R by its image in the “mirror” GF (see Figure 2.7) then 
we get an up-right route from H(—1,1) to B(n,n). Conversely, any up-right 
route from H to B must cross GF somewhere, and arises from precisely one 
bad route from A to B. So the number of bad routes is just the number of 


up-right routcs from (—1,1) to (n,n), which is 


n+1l+n-1\ _ 2n 
n+1 ~ Anti)’ 


So finally the number of good routes from A to B is 
4 

an\  { 2n\ _ (2n n fan\ 1 (2n r 

n n+1/ \n n+i\n/ n+1\n}° a 


Exercises 


Exercise 2.1 


Solve the recurrence relations 


(a) an = $4n-1+1,0) = 1; 
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(b) an = 5an-1 — 6an-2, a; = —1,a2 = 1; 
(c) Qn = Gan_1 — 9@n_2, a, = 1,02 = 9; 
(d) Qn = 4an—1 — 3an—2 + 2", a; = lrag = 11. 


Exercise 2.2 


Let b,, denote the 


Exercise 2.3 


Let d,, denote the number of n-digit se 


ences in which each digit is 0,1 


mm RTL WiMCI CAC Gisit if uy, s 


or 2, and containing no two consecutive Is and no two consecutive 2s. 
Show that d, = 2dp,-1 + dyn_2. Solve this recurrence and deduce that 
dy = 1+2(°F!) +22") HPP) oe 


Exercise 2.4 


Use generating functions to solve Exercise 2.1(a) and 2.1(b). 


Exercise 2.5 


Fibonacci’s rabbits. Start with 1 pair of rabbits, and suppose that each 
pair produces one new pair in each of the next two generations and 
then dies. Find f,, the number of pairs belonging to the nth generation 


(fA =1= fe). 


Exercise 2.6 
Solve the recurrence relation (2.1) for the flags by iteration. 
Exercise 2.7 


The Lucas numbers Ly are defined by ZL; = 1, L2 = 3, Ly = En_y+Ln_2 
(n > 3). Obtain a formula for Ly. 


Exercise 2.8 


Solve the recurrence (2.12) by using the method given in Example 2.4, 
first eliminating 1 and then eliminating powers of 2. You should obtain 
Qn — 5An 1 + 8a, 2— 4a, 3 = 0. 
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Exercise 2.9 


Verify that if a}, and ai are two solutions of the recurrence a, = Aan_)+ 
Bany—2 then a), + a7, is also a solution. 


Exercise 2.10 


Show that the generating function for the Fibonacci sequence is wate. 
Hence obtain (2.7). 


Exercise 2.11 


Let M = (1). 

(a) Prove that M"t? = aa mo) where F,, is the nth Fibonacci num- 
ber. 

(b) By taking determinants show that F,Fn42— F2,, = (-1)". 

(c) By considering the identity M™+"+? = M™+14"+1) prove that 
Pion & BiB tcl Pins 


Exercise 2.12 
Prove that Fy + Fo +---+ Fr = Fri2 - 2. 


Exercise 2.13 


For each of the following, work out the values for the first few values 
of n and make a guess at the general case. Then prove your guesses by 
induction. 

(a) Fi t+ Fg + Fo +°°+4 Fan-i; 

(b) Po+Fyot+ Fe t---4+ Fon; 

(c) Fi — Fot+ Fy -—+--4+(-1)" 1. 


Exercise 2.14 


In bellringing, successive permutations of n bells are played one after 
the other. Following one permutation 7, the next permutation must be 
obtained from a by moving the position of each bell by at most one 
place. For example, for n = 4, the permutation 1234 could be followed 
by any one of 2134, 2143, 1324, 1243. Show that if a, denotes the number 
of permutations which could follow 12...n, then an = @n_-1 +@n-24+1. 
Hence find ay. 
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Exercise 2.15 


(a) Let gn denote the number of subsets of {1,...,n} containing no two 
consecutive integers. Thus, for example, g; = 2 (include the empty 
set!) and go = 3. Find a recurrence relation for gn, and deduce that 
9n = fn4i- 

(b) A k-element subset of {1,...,n} can be considered as a binary se- 
se oie! a ON ce a ee ae eS ee oe ae We of Se Wa a 
quence ou ICME ULL Tt CULL A 15 alu Tt A US (sce DAA pIe A-1f). 
Use Example 1.17 to show that the number of k-subsets of {1,... ,n} 
containing no two consecutive integers is (iy, 

(c) Deduce that F,, = Dix in (","). How does this relation show up in 
Pascal’s triangle? 


Exercise 2.16 


Let t, denote the number of ways of triangulating a convex (n + 2)-gon 
by drawing n — 1 diagonals. Show that t,, = C, as follows. Label the 
vertices 1,...,2 +2, and consider the triangle containing edge 12. If it 
contains vertex r as its third vertex, in how many ways can the remaining 
two parts of the interior of the (n + 2)-gon be triangulated? Deduce that 
tn = > tit; where summation is over all pairs 1,7 withi+j =n —- 1. 


Exercise 2.17 


Show that if 2n points are marked on the circumference of a circle and if 
ay, is the number of ways of joining them in pairs by n non-intersecting 
chords, then an = Cp. 


Exercise 2.18 
-4.6...(4n—2 


Derive Euler’s formula C,, = 2 aI! for the Catalan numbers, and 
note that (n + 1)Cy = (4n — 2)Cy-1. 


Exercise 2.19 
Prove that d, > (n — 1)! for all n > 4. 


Exercise 2.20 


Insertionsort. Sort a list 41,...,2%n into increasing order as follows. At 
stage 1, form list L; consisting of just z,. At stage 2, compare 2, with 
x2 and form list L2 consisting of x; and x2 in increasing order. At stage 
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t, when 21,...,2;—-) have been put into list L;_, in increasing order, 
compare x; with each 2, in £;_; in turn until its correct. position is 
obtained; this creates list L;. Repeat until LZ, is obtained. Compare the 
efficiency of this method with that of bubblesort. 


Exercise 2.21 


In a mathematical model of the population of foxes and rabbits, the 

populations z, and y, of foxes and rabbits at the end of n veers are 
Bn+i _f{ a6 O.5\ {zn 

related by Ge a) =( G46. 1. 

Show that 52n42 — 92n41 +42 = 0, and hence find z, in terms of 29 


and Yo- 


Deduce that zr, 7 Bayo — 29 as nm —> 00, provided rp < By. What 
happens to yn? 


Exercise 2.22 


In a football competition, there are n qualifying leagues. At the next 
stage of the competition, each winner of a league plays a runner up in 
another league. In how many ways can the winners and the runners up 
be paired? 
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Introduction to Graphs 


We introduce the idea of a graph via some examples, and concentrate on two 
types of graph, namely trees and planar graphs. Further graph-theoretic topics 
will be covered in the next chapter. 


3.1 The Concept of a Graph 


Example 3.1 (The seven bridges of Konigsberg) 


In the early eighteenth century there were seven bridges over the River Pregel 
in the Eastern Prussian town of K6nigsberg (now Kaliningrad). It is said that 
the residents tried to set out from home, cross every bridge exactly once and 
return home. They began to believe the task was impossible, so they asked 
Euler if it were possible. Euler’s proof that it was impossible is often taken to 
be the beginning of the theory of graphs. What Euler essentially did (although 
his argument was in words rather than pictures) was to reduce the complexity 
of Figure 3.1(a) to the simple diagram of 3.1(b), where each land mass is 
represented by a point (vertex) and each bridge by a line (edge). If the desired 
walk existed, then each time a vertex was visited by using one edge, then 
another edge would be used up leaving the vertex; so every vertex would have 
to have an even number of edges incident with it. Since this is not the case, the 
desired walk is impossible. 

The diagram of Figure 3.1(b) is an example of a graph. It has four vertices 
and seven edges. 
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A 


An old problem concerns three houses A, B,C’ which are to be joined to each 
of the three utilities, gas, water and electricity, without any two connections 
crossing each other. In other words, can the diagram of Figure 3.2 be redrawn 
so that no two lines cross? The diagram is another example of a graph. 


A Bo C 


G Ww E 
Figure 3.2 The utilities graph 


Definition 3.1 


A graph G consists of a finite set V of vertices and a collection E of pairs of 
vertices called edges. The vertices are represented by points, and the edges by 
lines (not necessarily straight) joining pairs of points. If an edge ¢ joins vertices 
x and y then z and y are adjacent and ¢ is incident with both z and y. Any 
edge joining a vertex zx to itself is called a loop. 

Note that we say £ is a collection of pairs, not a set of pairs. This is to 
allow repeated edges. If two or more edges join the same two vertices, they are 
called multiple edges. For example, the graph of Figure 3.1(b) has two pairs 
of multiple edges. The graph of the utilities problem is simple, i.e. it has no 
loops or multiple edges. 

The number of edges incident with a vertex v in a graph without loops 
is called the degree or valency of v and is denoted by d(v). The second 
name recalls one of the early occurrences of graphs, as drawings of chemical 
molecules. For example, ethane (C2Hg) can be represented by the graph of 
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Figure 3.3, where the two “inside” vertices, of valency 4, represent the two 
carbon atoms (carbon has valency 4), and the six other vertices, of valency 
1, represent hydrogen atoms. Vertices of degree 1 are called pendant or end 


vertices. 
Figure 3.3 Ethane 


When a graph contains a loop, the loop is considered to contribute twice to 
the degree of its incident vertex. This convention enables us to establish the 
following useful result. 


Theorem 3.1 


The sum of the degrees of the vertices of a graph is twice the number of edges. 


Proof 
Each edge contributes twice to the sum of the degrees, once at each end. 


This result is sometimes called the handshaking lemma: at a party, the total 
number of hands shaken is twice the number of handshakes. It has an immediate 
corollary. 


Corollary 3.2 


In any graph, the sum of the vertex degrees is even. 


Example 3.3 


The complete graph K,, is the simple graph with n vertices, in which each 
pair of vertices are adjacent. Since each of the n vertices must have degree n—1, 
the number q of edges must satisfy 2q = n(n — 1), so that q = $n(n — 1). This 
of course is as expected, since q is just the number of ways of choosing two of 
the n. vertices, i.e. q = (5) = n(n - 1). 
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The graphs K,,,n < 4, are shown in Figure 3.4. The notation K,, is in hon- 
our of the Polish mathematician K. Kuratowski (1896-1980) whose important 
theorem on planarity will be mentioned in Section 3.6. Note that K4 contains 
Ks within it; this idea of one graph being contained in another is formalised in 
the next definition. 


Definition 3.2 


A graph H is said to be a subgraph of a graph G if the vertex set of H isa 
subset of the vertex set of G, and the edge set of H is a subset of the edge set 
of G. fe 

Thus, for example, K,, is a subgraph of K, wherever m < n; simply restrict 
K,, to m of its vertices. 

Finally in this section, we establish some standard notation. From now on, 
we shall use p and q to denote the numbers of vertices and edges respectively, 
and by a (p,q)-graph we shall mean a graph with p vertices and gq edges. Thus, 
for example, K,4 is a (4,6)-graph. 


3.2 Paths in Graphs 


Many important applications of graph theory involve travelling round the 
graph, in the sense of moving from vertex to vertex along incident edges. We 
make some definitions related to this idea. 


Definition 3.3 
A walk in a graph G is a sequence of edges of the form 
Up1, V1V2, V2ZU3,-.-- ,;Un—1Un- 

This walk is sometimes, in a simple graph, represented more compactly by 
Vo > Vv} 7 v2 7+: Un. Note that there is an implied direction to the walk. 
vo is called the initial vertex and v, the final vertex of the walk; the number 
(n) of edges is called the length of the walk. 

A walk in which all the edges are distinct is called a trail. A trail in which 
all vertices vp,... , Un are distinct (except possibly vp; = ug) is called a path; a 
path vp 4... v_, with v, = vo is called a cycle. 


Example 3.4 

In the graph of Figure 3.5, 
zrurtyroutyu is a trail but not a path; n 
uryowru is a path of length 3; ; * 


uryrowrou tu is a cycle of length 4. 
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It seems natural to consider the cycles u y vA uandy vu auyy 


Figure 3.5 


to be the same; so often we identify a cycle with the set of its edges. We use 
the notation (for n > 1) 


C, = cycle of length n (i.e. with n edges and vertices); 
P, = path of length n — 1 (i.e. with n vertices). 

Thus, for example, P2 = K2 and C3 = K3. 

Definition 3.4” 


A graph is connected if, for each pair x,y of vertices, there is a path from 
ztoy. A graph which is not connected is made up of a number of connected 
pieces, called components. 


3.3 Trees 


Definition 3.5 


A tree is a connected simple graph with no cycles. 

For example, the ethane graph in Figure 3.3 is a tree, as is each P,. Note that 
the ethane graph has p = 8 and q = 7, while P, has p=n and g = n-—1;in 
each case, p—q = 1. This property in fact characterises those connected graphs 
which are trees. Our proof of this depends upon the following useful result. 


Theorem 3.3 
IfT is a tree with p > 2 vertices then T contains at least two pendant vertices. 
Proof 


Since T has p vertices, all paths in T must have length less than p. So there 
must be a longest path in T, say v1 4 ve 3... — v,. We claim that v, and 
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v, both have degree 1. Suppose v, has degree > 1; then there is another edge 
from v1, say v1 U9, where vg is none of v2,.-. ,v, (otherwise there would bea 
cycle), so v9 + v1 +... v, would be a longer path. So uv; has degree 1, and 
a similar argument holds for v-. 


Theorem 3.4 
Let T be a simple graph with p vertices. 
equivalent: 


(i) T is a tree; 


(ii) T has p — 1 edges and no cycles; 


(i) => (ii) We have to show that all trees with p vertices have p — 1 edges. 
This is certainly true when p = 1. Suppose it is true for all trees with k > 1 
vertices, and let T be a tree with k + | vertices. Then, by Theorem 3.3, T 
has an end vertex w. Remove w and its incident edge from T to obtain a tree 
T' with k vertices. By the induction hypothesis, T’ has k — 1 edges; so T has 
(k —1)+1=k edges as required. 


(ii) => (iii) Suppose T has p — 1 edges and no cycles, and suppose it consists 
of t > 1 components, T;,... , 7, each of which has no cycles and hence must 
be a tree. Let pj denote the number of vertices in T;. Then 5°, p; = p, and the 
number of edges in T is 5>,(pi -1) = p—t. Sop—t=p-—1, i.e. t =1, so that 
T is connected. 


(iii) = (i) Suppose T is connected with p — 1 edges, but is not a tree. Then T 
must have a cycle. Removing an edge from a cycle does not destroy connect- 
edness, so we can remove edges from cycles until no cycles are left, preserving 
connectedness. The resulting graph must be a tree, with p vertices and g < p-1 
edges, contradicting (ii). 


This theorem can be used to establish the tree-like nature of certain chemical 
molecules. 


Example 3.5 
Show that the alkanes (paraffins) C,Han+2 have tree-like molecules. 


Solution 


Each molecule is represented by a graph with n + (2n + 2) = 3n + 2 vertices. 
Of these, n have degree 4 and 2n + 2 have degree 1, so, by Theorem 3.1, 


2g = 4n+2n+2=6n+2 
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whence gq = 3n+1 = p—I. Since molecules are connected, the graphs must be 
trees, by Theorem 3.4. 


The first few alkanes are shown in Figure 3.6. 


ae dae 


methane ethane propane 
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butane isobutane 
Figure 3.6 Alkanes 


Note that there are two “different” trees corresponding to C4Hio. 


Definition 3.6 


Two graphs G,, G2 are isomorphic if it is possible to label the vertices of both 
graphs by the same labels, so that, for each pair u,v of labels, the number of 
edges joining vertices u and v in G, is equal to the number of edges joining u 
and v in Go. 


Example 3.6 : 


(i) The graphs portrayed by the last two diagrams in Figure 3.4 are isomorphic. 
(ii} The butane and isobutane graphs (Figure 3.6) are not isomorphic. The 
second graph has one vertex of degree 4 joined to all the other verbices of 
degree 4, but this does not happen in the first graph. 


Tree diagrams such as those in Figure 3.6 were introduced in 1864 by the 
chemist A. Crum Brown in his study of isomerism, the occurrence of molecules 
with the same chemical formula but different chemical properties. The problem 
of enumerating the non-isomorphic molecules C,,Hen+2 was eventually solved 
by Cayley in 1875, but his solution is beyond the scope of this book. 

A related problem was: find T(n), the number of non-isomorphic trees with 
n vertices. We have T(1) = T(2) = T(3) = 1, and, as the reader should check, 
T(4) = 2,7(5) = 3,7 (6) = 6. No simple formula for T(n) exists, although 
T(n) is the coefficient of z" in a known but very complicated series. However, 
there does exist a very nice formula for the number of trees on n given labelled 
vertices. For example, although T(3) = 1, there are three labelled trees with 
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vertices labelled 1,2,3 as shown in Figure 3.7. It was established by Cayley in 
1889 that the number of labelled trees on n vertices is n™—?. A proof of this 
will be given in Chapter 6. 


-_ 
ae 
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3.4 Spanning Trees 


Suppose that a connected graph represents a railway system, the vertices rep- 
resenting the towns and the edges the railtracks. Suppose also that the gov- 
ernment wishes to get rid of as much track as possible, nevertheless retaining 
a rail system which connects all the towns. What is required is a tree which is 
a subgraph of the given graph, containing all the vertices. 


Definition 3.7 


A spanning tree of a connected graph G is a tree which contains all the 
vertices of G and which is a subgraph of G. 


Example 3.7 


(i) K3 has three spanning trees, as shown in Figure 3.7. 


(ii) K4 has 16 = 4? spanning trees. Draw them. Do you see how this relates to 
Cayley’s 1889 result? ' 


(iii) In the graph of Figure 3.5, the edges zu, cu, uy, yv, yw form a spanning 
tree. 


In the case of a weighted graph G, i.e. when each edge e of G has a weight w/e) 
assigned to it, where w(e) is a positive number such as the length of e, then it 
may be desired to find a spanning tree of smallest. possible total weight. ‘There 
are several different algorithms which find such a minimum weight spanning 
tree of G. 

The greedy algorithm : 


This is often known as Kruskal’s algorithm. 
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Procedure oe 
(i) Choose an edge of smallest weight. 


(ii) At each stage, choose from the edges not yet chosen the edge of smallest 
weight whose inclusion will not create a cycle. 


iii) Continue until a spanning tree is obtained. 


(Uf the given graph has p vertices, the algorithm will terminate after p— 1 edges 
have been chosen.) 


Example 3.8 


Apply the greedy algorithm to the graph of Figure 3.8. 


A 4 B 
6 
2 Cc 
8 
E 6 D 
Figure 3.8 


Solution 


First choose AE (weight 2). Then choose BD(3), then AB(4). We cannot now 


ae a 3 min wo eaten an on tn ABT A Giewtlawler ern 
choose AD(5) since its inclusion would create a cya ADLA. Simuary we 


cannot choose DE. So choose BC'(6). The edges AE, AB, BD, BC then form 
a minimum weight spanning tree of weight 2+ 3+4+6 = 15. 


Justification of the greedy algorithm 


Suppose that the greedy algorithm produces a tree T, but that there is another 
spanning tree U which has smaller weight than T. Since T 4 U, and both have 
the same number of edges, there must be an edge in T not in U: let e be such 
an edge of minimum weight. The addition of e to U must create a cycle C, 
and this cycle must contain an edge e’ which is not in T. Now w(e’) > w(e), 
since if w(e’) < w(e) then e' would have been chosen by the greedy algorithm 
rather than e. So if we remove e’ from C we obtain a spanning tree V such that 
w(V) < w(U), and V has one more edge in common with T than U had. By 
repeating this process we eventually change U into T, one edge at a time, and 
conclude that w(T) < w(U) < w(T), a contradiction. So no such U can exist. 
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The greedy algorithm is so called because it greedily minimises the weight at 
each step, ignoring possible future complications; fortunately it gets away with 
this strategy. The disadvantage of the algorithm, however, lies in the difficulty 
of determining at each stage whether or not a cycle would be created by the 
inclusion of the smallest weighted edge available (this is particularly true when 
the graph is large). This problem can be overcome by using a slightly different 
algorithm, due to Prim (1957). In Prim’s algorithm, the graph constructed is 
connected (and hence a tree) at each stage of the construction (unlike the greedy 
algorithm, which chose BD immediately after AE in the above example), and 
at. each stage the smallest. weight edge is sought. which joins the existing tree 
to a vertex not in the tree. Clearly the inclusion of this edge cannot create a 
cycle. 


Prim’s algorithm 
(i) Select any vertex, and choose the edge of smallest weight from it. 


(ii) At each stage, choose the edge of smallest weight joining a vertex already 
included to a vertex not yet included. 


(iii) Continue until all vertices are included. 


Example 3.8 (revisited) 


Use Prim’s algorithm starting at B. Choose BD(3), then BA(4), then AE(2), 
then BC(6) to obtain the same spanning tree as before. 


A third algorithm operates by removing edges from the given graph, de- i 
stroying cycles, until a spanning tree is left. At each stage remove the largest- ° 
weighted edge whose removal does not disconnect the graph. In Example 38, 

we could remove DC, then DE, then AD. Clearly this approach would be | 


quicker than the others if the graph has “few” edges. 


3.5 Bipartite Graphs 


Definition 3.8 


A graph is bipartite if its vertex set V can be partitioned into two sets B,W 
in such a way that every edge of the graph joins a vertex in B to a vertex in 
W. The partition V = BU W is called a bipartition of the vertex set. 


% 


Example 3.9 


Labellings show that the graphs in Figure 3.9 are bipartite. In both graphs, 
each edge joins a B toa W. 
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B B B Ww B 
Ww 
B 
. o_o 
Ww Ww WwW B Ww 
Figure 3.9 Bipartite graphs 


If we interpret B and W as black and white, we see that a graph is bipartite 
precisely when the vertices can be coloured using two colours so that no edge 
joins two vertices of the same colour. For this reason, bipartite graphs are 
sometimes called bichromatic. 


Example 3.10 


The cycle C,, is bipartite if and only if n is even. 


Theorem 3.5 


A connected graph is bipartite if and only if it contains no cycle of odd length. 


Proof 


If a graph G contains an odd cycle (i.e. a cycle of odd length) then it cannot 
possibly be bipartite. So suppose now that G contains no odd cycle; we shall 
show how to colour its vertices B and W. 

Choose any vertex v of G, and partition V as BUW where 


B={ueéV-: shortest path from v to u has even length }, - 
W ={ueéV-: shortest path from v to u has odd length }. 


We have u € B since 0 is even; we have to check that no edge of G has both 
ends in B or both ends in W. 

Suppose there is an edge zy with x € B and y € B. Then, denoting the 
length of the shortest path from vertex v1 to vertex vo by d(a,,%2), we have 
d(v,z) = 2m and d(v,y) = 2n for some integers m,n. But there is a walk from 
vtoy via z of length 2m+1, so 2n < 2m+1. Similarly 2m < 2n4+1,som=n. 

Denote the shortest paths from v to z and y by P(x) and P(y) respectively. 
Then, since m = n, both P(x) and P(y) have equal lengths. Let w be the last 
vertex on P(x) which is also on P(y) (possibly w = v). Then the part of P(z) 
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from w to z and the part of P(y) from w to y must be of equal length, and, . 
since they have only w in common, they must, with edge ry, form an odd cycle. 
But G has no odd cycles, so the assumption of the existence of the edge zy 
must be false. So there is no edge with both edges in B; similarly there is no 
edge with both edges in W. 


| 
| 


Corollary 3.6 


All trees are bipartite. 


APR Uc RE 


Definition 3.9 (Complete bipartite graphs) 


A simple bipartite graph with vertex set V = BU W is complete if every } 
vertex in B is joined to every vertex in W. If |B| = m and |W| =n, the graph | 
is denoted by Km,n or by Kn.m. For example, the utilities graph of Figure 32 | 
is K3,3, and the methane graph of Figure 3.6 is K,,4. : 

Clearly, Kn has m+n vertices and mn edges; m of the vertices have degree 5 
n, and n of the vertices have degree m. 

The complete graphs K,, and the complete bipartite graphs K,,,, play im | 
portant roles in graph theory, particularly in the study of planarity to which ° 
we now turn. 


3.6 Planarity 


A graph is planar if it can be drawn in the plane with no edges crossing. The 
concept of planarity has already appeared in the utilities problem, which can 
be restated as: is K33 planar? If a graph is planar, then any drawing of it 
with no edges crossing is called a plane graph. For example, K,4 is planar, as 
was shown in Figure 3.4; the second drawing of K4 there was a plane graph, } 
establishing its planarity. 

Planar graphs occur naturally in the four-colour problem. In colouring a 
map, it is standard procedure to give adjacent countries different colours. It 
became apparent that four colour always seemed to be sufficient to colour any 
map, and a general proof of this statement was attempted by A.B. Kempe in 
1879. Ten years later, Heawood discovered that Kempe’s “proof” was flawed, 
and instead of the four-colour theorem we had the four-colour conjecture. Even- 
tually, in 1976, the truth of the conjecture was established by two mathemati- 
cians, K. Appel and W. Haken; as the postmark of the University of Llinois 
asserted, “four colours suffice”. 

The problem of colouring a map can be transformed into one of colouring the 
vertices of a planar graph. Given a map, we can represent each region by a ver- 
tex, and join two vertices by an edge precisely when the corresponding regions } 
share a common boundary. For example, Figure 3.10 shows a map and a planar } 
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giaph representing it. 


* of colouring the vertices 
A B 
Cc 
Cc 
D E 
Figure 3.10 


of a planar graph with four colours, so that no two adjacent vertices receive 

the same colour. Colourings of graphs will be discussed further in Chapter 5. 
Any plane graph clearly divides the plane into disjoint regions, one of which 

is infinite. The basic result about plane graphs is known as Euler’s formula; 


Euler initially studied it in the context of polyhedra, and we shall look at this 
in the next section. 


Theorem 3.7 (Euler's formula) 
Any connected plane (p, q)-graph divides the plane into r regions, where 


p-q+r=2. 


Proof 


If there is a cycle, remove one edge from it. The effect is to reduce g and r by 1 
(since two regions are amalgamated into one), and to leave p unchanged. So the 
resulting graph has p' = p,q’ = g—1,r’ =r —1, where p! —q/ +r’ =p—qetr. 


Repeat this process until no cycles remain. The final graph must be a tree, 
with p’ -—q’ +r” =p— ery +1=2 


Example 3.11 
The plane graph in Figure 3.10 has 


p-qtr=5-8+5=2. 
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There are four finite regions and one infinite region. 


We now define the degree of a region of a plane graph to be the number of 
encounters with edges in a walk round the boundary of the region. 


Example 3.12 
In Figure 3.11 regions 3 and 4 have degree 3, the infinite region 1 has degree 


Figure 3.11 


5, and region 2 has degrce 9 (note that one edge is encountered twicc, once on 
each side). 
Parallel to the handshaking lemma we have: 


Theorem 3.8 


In a connected plane graph, 2g = sum of degrees of the regions. 


Theorem 3.9 
Ky, is planar only if n < 4. 


Proof ~ 


It is enough to show that Ks is non-planar. (Why?) Now Ks has p = 5,q = 10, 
so if a plane drawing of Ks exists it must have r = 2-5 + 10 = 7 regions. 
Each of the seven regions must have degree > 3, so, by Theorem 3.8, 20 = 2q > 
7x 3= 21, a contradiction. 


Theorem 3.10 


K3,3 is not planar. 


Proof 

K33 has p = 6 and q = 9, so if a plane drawing exists it must have r = 
2—-6+9= 5 regions. Since K3,3 is bipartite, with no odd cycles, each region 
must have degree > 4, so we must have 18 = 2g > 4 x 5 = 20, a contradiction. 


3. Introduction to Graphs 57 


Corollary 3.11 
Kan is planar < min(m,n) < 2. 


The technique of counting the sum of the degrees of the regions is a useful 
one. We can apply it to the famous Petersen graph, shown in Figure 3.12. (See 
Section 4.1 and Exercise 5.17 for more about this graph.) 


Js 
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Figure 3.12 The Petersen graph 


Example 3.13 


The Petersen graph is not planar. 


Solution 


Suppose a plane drawing exists. Since p = 10 and g = 15, we would have 
r= 2-10+4+ 15 = 7. Now the shortest cycle in the graph clearly has length 


5, s0 every region must have degree > 5. So we would have a contradicti 


30= 2g >7x5=35. 
Kuratowski’s theorem 


What makes a graph non-planar? Clearly, if it contains Ks or K3,3 as a sub- 
graph, then it cannot possibly be planar. It was proved in 1930 by the Polish 
mathematician Kuratowski that, essentially, it is only the presence of a Ks or 
a K3,3 within a graph that can stop it being planar. 

To clarify this statement, we first make the following observation. Since Ks 
is not planar, the graph shown in Figure 3.13 cannot be planar either. For if it 
were, we could make a plane drawing of it, erase b from the edge ac, and obtain 
a plane drawing of Ks. Inserting a new vertex into an existing edge of a graph 
is called subdividing the edge, and one or more subdivision of edges creates 
a subdivision of the original graph. 
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a b c 
Figure 3.13 


Theorem 3.12 (Kuratowski’s theorem) 

A graph is planar if and only if it does not contain a subdivision of Ks or K3; 
as a subgraph. 

The proof of this deep topological result is beyond the scope of this book. But 


we exhibit the result’s usefulness by using it to prove that the Petersen graph 
is non-planar. 


Example 3.13 (again) 


In Figure 3.14, Petersen’s graph is on the left. On the right is the same graph 
with two edges removed. This subgraph is a subdivision of K3 3 as shown by 
the labelling of the vertices. 

Another test for planarity will be given in Section 4.2. 


B 


Ww 


Figure 3.14 


Chords of a circle 


We close this section on planarity with an application of Euler’s formula toa" 
well-known problem concerning chords of a circle. 

Suppose we have nm points spaced round a circle, and we join each pair of 
points by a chord, taking care to ensure that no three chords intersect. at the 
same point. Into how many regions is the interior of the circle divided? The 
cases n = 3,4,5 are shown in Figure 3.15. It would appear that n = 6 should i 
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Sows Ce 


n=3 n=A4 n=dh 
4 regions 8 regions 16 regions 
Figure 3.15 


give 32 regions. But it does not! (Check!) 

Suppose we have n points and have drawn the (%) chords. There will be n 
regions with a circular arc as a boundary - let’s lay them aside and concentrate 
on the remaining regions. Turn the geometrical picture into a graph by putting 
a vertex at each of the n given points, and at each crossing point of chords. 
How many crossing points are there? There is one for each pair of chords which 
cross. But any pair of crossing chords is obtained by choosing 4 of the given 
n points and drawing the “cross” chords between them; so there must be (1) 
crossing points. So the resulting graph has p = n + (7) vertices. Each of the n 
original vertices has degree n — 1, and each of the new (7) vertices has degree 
4. So by the handshaking lemma 


Thus 
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= (3) +@) +209) 
= 2-n4+()+(3). 


Here r includes a count of 1 for an infinite region, so there are 1 ~ n + (3) + (3) 


finite regions. We have to add the n boundary regions which we put aside earlier, 
so finally the number of regions is 


1+(3)+ (2): 


Check that this gives 4,8, 16 for n = 3,4,5, and 31 for n = 6. 
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3.7 Polyhedra 


A polyhedron is a solid bounded by a finite number of faces, each of which is 
polygonal. For example, the pyramid in Figure 3.16(a) is a polyhedron with 
five vertices, five faces (four triangular, and one square base), and eight edges. 
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(a) 
Figure 3.16 A pyramid and its plane graph 


As was mentioncd carlier, Euler’s formula arose first in the study of polyhedra, 
relating the numbers of vertices, faces and edges in a convex polyhedron. (A 
polyhedron is convex if the straight line segment joining any two of its vertices 
lies entirely within it.) Such a polyhedron can be represented by a plane graph, 
obtained by projecting the polyhedron into a plane. The graph in Figure 3.16(b) 
represents the pyramid; think of the internal vertex as the top of the pyramid, 
and think of the base of the pyramid as being represented by the infinite region 
(of degree 4). 

The cube is an example of a regular polyhedron. A polyhedron is regular 
if there exist integers m > 3,n > 3 such that each vertex has m faces (or m 
edges) meeting at it, and each face has n edges on its boundary. For a cube, 
m = 3 and n = 4. Convex regular polyhedra are known as Platonic solids; 
they were discussed at great length by the ancient Greeks who knew that there 
were only 5 such solids. In the next theorem we use the terminology of graphs, 
moving from a polyhedron to its corresponding graph. 


Theorem 3.13 


Suppose that a regular polyhedron has each vertex of degree m and each face 
of degree n. Then (m,n) is one of (3, 3), (3, 4), (4,3), (3, 5), (5, 3). : 
Further, there exist Platonic solids corresponding to each of these pairs. 


Proof 
We have p — q+r = 2, where 


2q¢ = sum of vertex degrees = mp 


and 2q = sum of face degrees = nr. 
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So ee —1+ ae = 2, whence 
m nr 


(2m + 2n ~ mn)q = 2mn. (3.1) 


Thus, trivially, 2m+2n—mn > 0, i.e. (m—2)(n—2) < 4. So (m—2)(n—-2) = 1,2 
or 3, and the five possibilities arise. 

For each possible pair (m,n), we can find g from (3.1) and then deduce the 
values of p and r. We tabulate these values in Table 3.1, and give the name of 
the corresponding Platonic solid. 


Table 3.1 
5) ia p Name 
3|3]6 4 4 | tetrahedron 
3 {4/12 8 6 | cube 
4/3/12 6 8 | octahedron 


3.) 5 | 30 20 12 } dodecahedron 
5 | 3 | 30 12 20 | icosahedron 


Note that the names reflect the number r of faces. The five solids, and their 
plane graphs, are shown in Figure 3.17. 

As well as the five regular polyhedra just discussed, there exist the semireg- 
ular polyhedra known as the Archimedean solids. Although they may well 
have been known to the Greeks, the first known listing of them is due to Ke- 
pler in 1619. These solids have more than one type of face, but they have the 
property that each vertex has the same pattern of faces around it. For example, 
the truncated cube, obtained by slicing off each of the eight vertices, has eight 


triangular faces and six octagonal faces, and, at each vertex two octarons and 


aN Sular faces and SIX OCLAgONatl faces, ANG, at CAcn vertex, TWO CCLagons ana 


one triangle meet. 


Example 3.14 


A polyhedron is made up of pentagons and hexagons, with three faces meeting 
at each vertex. Show that there must be exactly 12 pentagonal faces. 


Solution 


We have p—q+r = 2 and 2g = sum of vertex degrees = 3p. Thus 2g = 6r — 12. 
Now suppose there are z pentagonal and y hexagonal faces. Then r = x + y 
and 2g = sum of degrees of faces = 5z + 6y. Substituting into 2g = 6r — 12 
gives 5a + Gy = 6x + 6y — 12, whence z = 12. 


The case x = 12,y = 0 corresponds of course to a dodecahedron. The case 
zt = 12,y = 20 corresponds to the pattern often seen on a soccer ball. The 
corresponding Archimedean solid is a truncated icosahedron; the reader 
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tetrahedron = 


cube 
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octahedron 


e- dodecahedron 
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Figure 3.17 


should be able to see how to obtain one by slicing vertices off an icosahedron. 
This solid aroused great interest in the 1990s when it was discovered that a 
third form of carbon existed (as well as diamond and graphite). 

This form is denoted by Cgo; the molecular structure is that of 60 carbon 
atoms situated at the vertices of a truncated icosahedron. The discoverers of 
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this molecule called it Buckminsterfullerine (it is commonly known as a 
Buckyball) since they considered it as similar to a geodesic dome created by 
the architect R. Buckminster Fuller. But, as we have pointed out, it has been 
known to mathematicians for a long time. 

The graphite form of carbon has the carbon atoms arranged in a flat hon- 
eycomb pattern of hexagons. Hexagons tile the plane, so need the addition of 
n-gons with n < 6 to enable a 3-dimensional form to take place. It turns out 
that 12 pentagons are just right to enable a complete closing up to take place. 
See Exercise 3.14 for the corresponding problem when pentagons are replaced 
by squares. 

There are other fullerine molecules, such as C79 which has 12 pentagons and 
25 hexagons; its shape is more like a rugby ball. 


Exercises 


Exercise 3.1 


Prove that the number of vertices of odd degree in a given graph is even. 


Exercise 3.2 


Show that all alcohols C,,H2,4.,0H have tree-like molecules. (The valen- 
cies of C,O,H are 4,2,1 respectively.) 


Peewee 22 
LACILIO“N 2.9 


Show that if G is a simple graph with p vertices, where each vertex has 
degree > 4(p—1), then G must be connected. (Hint: how many vertices 
must each component have?) 


Exercise 3.4 


How many spanning trees do the graphs in Figure 3.18 have? 


(a) (b) 


Figure 3.18 
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Exercise 3.5 


How many edges must be removed from a connected (p, q)-graph to ob- 
tain a spanning tree? 


Exercise 3.6 


Let K2,3 have bipartition BUW where B = {a,b}, W = {x1, 22, 23}. 

(a) Explain why, in a spanning tree of K>,3, there must be precisely one 
of the vertices 2; joined to both a and b. 

(b) How many spanning trees does K23 have? 

(c) How many spanning trees does K2,199 have? 


Exercise 3.7 


Use (a) the greedy algorithm, (b) Prim’s algorithm to find a minimum 
weight spanning tree in the graph shown in Figure 3.19. 


Figure 3.19 


Exercise 3.8 


The distances between 5 Lanarkshire towns are given in Table 3.2. Find 
the shortest length of a connecting road network. 


Table 3.2 
G 
Glasgow 
Hamilton 
Airdrie 
Motherwell 


East. Kilbride 
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Exercise 3.9 


Pan Caledonian Airways (PCA) operates between 12 towns whose coor- 
dinates referred to a certain grid are (0, 2), (0, 5), (1,0), (1, 4), (2, 3), (2, 4), 
(3, 2), (3, 5), (4, 4), (4,5), (5, 3), (6,1). What is the minimum number of 
flights necessary so that travel by PCA is possible between any two of 
the towns? Find the minimum total length of such a network of flights. 


Exercise 3.10 
Determine which of the graphs in Figure 3.20 are planar. 


) nes 


Figure 3.20 


Exercise 3.11 


A complete matching of a graph with 2n vertices is a subgraph con- 
sisting of n disjoint edges. How many different complete matchings are 
there in the graph of Figure 3.20(a)? 


Exercise 3.12 
The graph G,,(n > 1) is shown in Figure 3.21. 
(a) Is G,, (i) bipartite? (ii) planar? 


(b) Let a, denote the number of complete matchings of G,. Show that 
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a, = 3 and a2 = 5. Show that a, = @n—1 + 2@n—2 (rn > 3) and hence 
obtain a formula for a,,. 


Exercise 3.13 


(a) Show that if G is a simple planar (p,q) graph, p > 3, then q < 3p-6. 
Deduce that Ks is not planar. 
(b) Show that if G is a simple planar (p,q) graph, p > g, where g 
is the girth of G, i.e. the length of the shortest cycle in G, then 
< s2y(p 2). 
(c) Deduce from (b) that A3,3 and the Petersen graph are both nonpla- 
nar. 


Exercise 3.14 


A convex polyhedron has only square and hexagonal faces. Three faces 
meet at each vertex. Use Euler’s formula to show that there must: be ex- 
actly six square laces. ‘The cube has no hexagonal faces: give an example 
with six square faces and at least one hexagonal face. (Try truncating 
an octahedron.) 


Exercise 3.15 


Suppose n cuts are made across a pizza. Let p, denote the maximum 
number of pieces which can result (this happens when no two cuts are 
parallel or meet. outside the pizza, and no three are concurrent). 


Prove that pn = (3) + (7) + (9). 


Exercise 3.16 


Let h, denote the number of spanning trees in the fan graph shown in 
Figure 3.22. Verify that hi = 1,h2 = 3,h3 = 8. 


| 
7 
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Find a recurrence relation for h, and hence show 
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) and hence Figure 3.22 


1q < 3p-6. 
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Travelling Round a Graph 


In this chapter we consider various problems relating to the existence of certain 
types of walk in a graph. The reader should recall the definitions of walk, path, 
cycle and trail given in Section 3.2. The Konigsberg bridge problem concerns 
the existence of a trail which is closed and contains all the edges of the graph. 
We study such (Eulerian) trails in more detail, but first we look at a related type 
of problem associated with the name of the Irish mathematician Sir William 
Rowan Hamilton (1805-1865). 


4,1 Hamiltonian Graphs 


The dodecahedron is shown at the ond of Chanter 3. Hamilton posed the prob 


The dodecahedron is shown at the end of Chaptcr 3. Hamilton posed the prob- 
lem: is it possible to start at one of the 20 vertices, and, by following edges, visit 
every other vertex exactly once before returning to the starting point? In other 
words: is there a cycle through all the vertices? You should have no problem 
finding such a cycle (turn to Figure 4.3 if you get stuck), so it is perhaps not 
surprising that the commercial exploitation of this problem as a game was not 
a financial success. 


Definition 4.1 


A hamiltonian cycle in a graph G is a cycle containing all the vertices of G. 
A hamiltonian graph is a graph containing a hamiltonian cycle. 

The name hamiltonian is, as often happens in mathematics, not entirely just, 
since others such as Kirkman had studied the idea before Hamilton. 
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Example 4.1 


(a) The octahedral graph is hamiltonian: in Figure 4.1(a) take the hamiltonian 
cycle 1234561. 


(b) The graph of Figure 4.1(b) is not hamiltonian. The easiest way to see this 
is to note that it has 9 vertices so that, if it is hamiltonian, it must contain a 
cycle of length 9. But, being a bipartite graph, it contains only cycles of even 
length. 


WK To 


Figure 4.1 


Theorem 4.1 


A bipartite graph with an odd number of vertices cannot be hamiltonian. 
4 


Example 4.2 
(a) K,, is hamiltonian for all n > 3. 
(b) Kmjn is hamiltonian if and only if m =n > 2. 


(O Boapeicn AY 
(oce DXEIUISe 4.1.) 


Example 4.3 


The Petersen graph is not hamiltonian. 


1 


5 a 2 


ST \ 
- 
4 


Figure 4.2 
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Solution 


Label the vertices as shown in Figure 4.2, and suppose there is a hamiltonian 
cycle. Every time the cycle goes from the outside along one of the “spokes” 
la, 2b, 3c, 4d, 5e, it has to return to the outside along another spoke. So the 
hamiltonian cycle must contain either 2 or 4 spokes. 


(a) Suppose there are 4 spokes in a hamiltonian cycle: we can assume 5e is the 
one spoke not in it. Then 51 and 54 must be in the cycle, as must eb and 
ec. Since la and 15 are in the cycle, 12 is not, so 23 is. But this gives the 
cycle 23 ce 62 as part of the hamiltonian cycle, which is clearly impossible. 


(b) Suppose there are just two spokes in the hamiltonian cycle. Take la as one 
of them. Then ac or ad is in the cycle - say ad. Then ac is not, so c3 is. So 
spokes 62,d4,e5 are not in the cycle. Since 62 is not in the cycle, 23 must 
be. Similarly, since d4 is not in, 34 must be in the cycle. So all three edges 
from 3 are in the cycle, a contradiction. 


There is no straightforward way of characterising hamiltonian graphs. Perhaps 
the best known simple sufficient. condition is that given by Dirac in the following 
theorem, but it must be emphasised that the condition given is not at all 
necessary (as can be seen by considering the cycle C,, n > 5). 


Theorem 4.2 (Dirac, 1950) 


IfG is a simple graph with p vertices, each vertex having degree > 5P, then G 
is hamiltonian. 


Proof 


Outlined in Exercise 4.6. 


4.2 Planarity and Hamiltonian Graphs 


There are some interesting connections between planar graphs and hamiltonian 
gaphs. The first arose in connection with the Four Colour Conjecture (FCC), 
when it was realised that the presence of a hamiltonian cycle in a plane graph 
makes the colouring of its regions (faces) with four colours very easy. For exam- 
ple, consider the problem of colouring the faces of a dodecahedron using four 
colours. Figure 4.3 shows a hamiltonian cycle which divides the regions into an 
internal chain of regions, and an external chain. Colour the internal chain with 
colours A and B, and the external chain with colours C and D. 

Early on in the history of the FCC, Tait conjectured that every polyhedral 
map in which every vertex has degree 3 has a hamiltonian cycle. (A map is 
polyhedral if any two adjacent regions meet in a single common edge or a 
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Figure 4.3 Dodecahedron 


single point.) The truth of Tait’s conjecture would have implied that every 
such map is 4-colourable; however, the conjecture was finally proved false in 
1946, when Tutte constructed a counterexample. 

Another connection between hamiltonicity and planarity occurs in the follow- 
ing algorithm which can be used to determine whether or not. a given hamilto- 
nian graph is planar. The basic idea is that if a graph G is both hamiltonian 
and planar, then, in a plane drawing of G, the edges of G which are not in the 
hamiltonian cycle A will fall into two sets, those drawn inside H and those 
drawn outside. 


The planarity algorithm for hamiltonian graphs 


1. Draw the graph G with a hamiltonian cycle H on the outside, i.e. with 7 
as the boundary of the infinite region. 


2. List the edges of G not in A: e),... ,é,. 


3. Form a new graph K in which the vertices are labelled e),... ,e, and where 
the vertices labelled €;,e; are joined by an edge if and only if e;,e, cross 
in the drawing of G, i.e. cannot both be drawn inside (or outside) A (such 
edges are said to be incompatible). 


4, Then G is planar if and only if K is bipartite. 


(If K is bipartite, with bipartition BUW, then the edges e; coloured B can be 
drawn inside H, and the edges coloured W can be drawn outside.) 
In practice, we introduce the edges one by one, as follows. 


Example 4.4 
Test the graph shown in Figure 4.4 for planarity. 


Solution 
1. The graph is already drawn with hamiltonian cycle abcdefa on the outside 
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Now consider bf. It crosses only ad. Next consid 


get: 
bf 
df ce 


Now consider ce. It also crosses df, so we get: 


of 
df ce 


4, By now we have the full graph K. (Check: the 
the number of crossings of edges in G.) Since K 
that G is planar, and we can draw it with ad an 
outside (Figure 4.5). 


Example 4.5 


Show that K3,3 is not planar. 
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Figure 4.6 


Solution 


apneepremonantonintceaey ener 


1. In Figure 4.6 we have K3,3 drawn with hamiltonian cycle on the outside. 


2. Edges not in hamiltonian cycle are ad, be, cf. 


9 eee 
v. Obtain: 


be 


ad 


cf 


4. This is not bipartite, so K3.3 is not planar. 


4.3 The Travelling Salesman Problem 


A sales representative of a publisher of mathematical texts has to make a round 
trip, starting at home, and visiting a number of university bookshops before 
returning home. How does the salesman choose his route to minimise the total 
distance travelled? 
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Here we consider a weighted graph, in which the vertices represent the book- 
shops and his home, and the edges represent the routes between them, each 
edge being labelled by the length of the route it represents. The salesman wishes 
to find a hamiltonian cycle of minimum length, i.e. of minimum total weight. 

A complete graph Ky, has (n — 1)! different hamiltonian cycles (or }(n — 1)! 
if we do not distinguish between a cycle and its “reverse”), so finding the one 
of minimum weight by looking at each in turn is out of the question when n is 
large. Even for n = 10, $(n—1)! = 181 440. There is no really efficient algorithm 
yet known for solving the travelling salesman problem (TSP), so “good” rather 
than “best” routes are sought, as are estimates, rather than exact values, of 
the shortest total length. 


Lower bounds : * 


Lower bounds can be found by using spanning trees. First observe that if we 
take any hamiltonian cycle and remove one edge then we get a spanning tree, 
80 


Solution to TSP > minimum length of a spanning tree (MST). (4.1) 


But we can do better. Consider any vertex v in the graph G. Any hamiltonian 
cycle in G has to consist. of two edges from v, say vu and vw, and a path from 
uto w in the graph G — (v) obtained from G by removing v and its incident 
edges. Since this path is a spanning tree of G — {v}, we have 


sum of lengths of two MST of 
+ . (4.2) 


Solution to TSP > 
G - {v} 


shortest edges from v 


Example 4.6 
Apply (4.2) to the graph of Figure 4.7. 


Solution 


Choose vertex a. The two shortest edges from a have lengths 3 and 6. The 
minimum weight spanning tree of G — {a} consists of edges bc, cd and ec, and 
has length 14. So, by (4.2), a lower bound for the TSP is 3+6+ 14 = 23. 
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Instead, we could have started with b. The two shortest edges from b have 
lengths 4 and 7, and the minimum weight spanning tree of G — {b} has length 
13, so we obtain the lower bound 4+ 7+ 13 = 24. This second bound gives us 
more information than the first. 


Upper bounds 


Assume that the weights are distances, satisfying the triangle inequality 
d(x, z) < d(x,y) + d(y, z) 


where d(x,y) denotes the shortest distance along edges from z to.y. In this 
case the following method gives upper bounds for the TSP in K,,. 

Find a minimum spanning tree of K,,, say of weight w. We can then find a 
walk of length 2w which visits every vertex at least once, and which returns to 
its starting point, by going “round” the tree as shown in Figure 4.8. 


a 


Figure 4.8 


We now try to reduce the length of this walk by taking shortcuts. Start at one 
vertex and follow the walk round. When we reach an edge which will take us 
to a vertex already visited, take the direct route to the next vertex not yet 
visited. For example, in Figure 4.8, which shows the minimum spanning tree 
of the graph of Example 4.6, we could start at a and obtain aecbda, which has 
length 26. 

Since this method yields a hamiltonian cycle of length no greater than twice 
MST, we have 


MST < solution to TSP < 2 MST, (4.3) 


and, since MST < solution to TSP, by (4.1), we have constructed a hamiltonian 
cycle of length at most twice the minimum possible length. In Section 4.5 we 
shall improve this to at most 3 times the minimum. 


4.4 Gray Codes 


A Gray code of order n is a cyclic arrangement of the 2” binary sequences of 
length n such that any pair of adjacent sequences differ in only one place. For 
example, Figure 4.9(a) shows a Gray code of order 3. 
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000 000 


Figure 4.9 


The industrial use of Gray codes is on account of their ability to describe 
the angular position of a rotating wheel. As in Figure 4.9(b), 0 and 1 are 
represented by white and black (off and on), and are read by electrical contact 
brushes. The fact that adjacent sequences differ in only one place reduces errors 
when the contact brushes are close to a boundary between segments. (Compare 
with 1999 changing to 2000 in a car milometer.) 

Note that the code above corresponds to a hamiltonian cycle in a 3-dimensional 
cube (follow the arrows in Figure 4.10). Note also that the cycle involves going 


z 

001 011 
101 
000 010 
y 
100 110 

=z 

Figure 4.10 


round the bottom of the cube (i.e. round a 2-dimensional cube!) with third 
coordinate 0, then moving up to change the third coordinate to 1, and then 
tracing out the 2-dimensional cube at the top, in the opposite direction. This 
idea generalises. So to obtain a Gray code of order 4, write down a Gray code 
of order 3 with 0 appended to each binary word, then follow it with the same 
Gray code of order 3, in reverse order, with 1 at the end of each word. This 
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gives 


0000 — 0100 — 1100 — 1000 — 1010 — 1110 — 0110 — 0010 — 0011- 


0111 — 1111 — 1011 — 1001 — 1101 — 0101 — 0001 — 0000. 


4.5 Eulerian Graphs 


The driver of a snow plough wishes to set out from the depot, travel along each 
road exactly once, and return to the depot. When is this possible? Similarly, 
the citizens of K6nigsberg wished to cross every bridge exactly once and retum 


hams Bath problems ask for a closed teal of a narticulas tune 
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Definition 4.2 


An eulerian circuit is a closed trail which contains each edge of the graph. A 
graph which contains an eulerian trail is called an eulerian graph. 


It was observed in Section 3.1 that a necessary condition for the existence of an 
eulerian circuit is that all vertex degrees must be even. It turns out that this 
condition is also sufficient in connected graphs. Our proof will use the following 
lemma. 


Lemma 4.3 


Let G be a graph in which every vertex has even degree. Then the edge set of 
G is an edge-disjoint union of cycles. 


Proof 

Proceed by induction on g, the number of edges. The lemma is true for g = 2, 
so consider a graph G with k edges and suppose that the lemma is true for all 
graphs with g < k. Take any vertex ug, and start a walk from vg, continuing 
until a vertex already visited is visited for the second time. If this vertex is v,, 
then the part of the walk from v; to v; is a cycle C. Remove C to obtain a graph 
H with < k edges and in which every vertex has even degree. By induction, 7 
is an edge-disjoint union of cycles, so the result follows. 


Theorem 4.4 


Let G be a connected graph. Then G is eulerian if and only if every vertex has 
even degree. 


' 
, 
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Proof 

>. Already shown. 

«. Suppose every vertex has even degree. Then the edges fall into disjoint 
cycles. Take any such cycle C,. If C, does not contain all the edges of G then, 
since G is connected, there must be a vertex v, € Ci and an edge v, v2 not in 
C,. Now vu, v2 is in some cycle, say C2, disjoint from C,. Insert C2 into C; at 
wy, to obtain a closed trail. If this trail does not contain all edges of G, take a 
vertex vg in C, UC» and edge v3vu4 not in Cy UC 2. Then v3v4 is in some cycle 
C3 which we insert into C, U C2. Continue in this way until all edges are used 
up. : 


Example 4.7 


Figure 4.11 


In Figure 4.11, first take cycle abcdefa. Then insert cycle agea at. a, and finally 
insert cycle bdhd at b to obtain eulerian trail 


ageabdhbcde fa. 
Definition 4.3 
An eulerian trail is a trail which contains every edge of the graph, but is 


not closed. A non-eulerian graph which contains an eulerian trail is called a 
semi-eulerian graph. 


. 


The following result follows immediately from Theorem 4.4. 


Theorem 4.5 


A connected graph G is semi-eulerian if and only if it contains precisely two 
vertices of odd degree. 


Example 4.8 


In the Kénigsberg bridge problem, suppose that one further bridge is built. 
The resulting graph will then have two vertices of odd degree and hence will 
contain an eulerian trail. 
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An upper bound for the TSP 


The following method yields a hamiltonian cycle in a complete graph whose 
length is at most 2 times the length of the minimum hamiltonian cycle. This 
improves the bound in Section 4.3. 

Given K,,, labelled by the length of the edges, first find a minimum spanning 
tree T’. ca EUR by Bar cee 3. 1, nae an even hackle 2m of he of odd 


edges of K,,. Such a set of Faislit ede is called a Beene There will 
be many ways of choosing such a matching, so we choose a matching M of 
smallest total length. If we now add the edges of M to T, we obtain the new 
graph M UT in which every vertex has even degree: thus MM U T' possesses an 
eulerian circuit. 

For example, with the graph of Example 4.7, T has length 17 (as in Figure 
4.8) and T has four vertices of odd degree. Take M = {ad, bc} to obtain MUT 
as shown in Figure 4.12. An eulerian circuit is aecbeda. 


a 


c 


Figure 4.12 


Starting at a, we can take aecb and, to avoid visiting c twice, go directly from 
b to d, and then to a, obtaining the hamiltonian cycle aecbda which has length 
26. 

We now show that the eulerian circuit obtained by this method always has 
length < 3MST. Let TSP, EC, MST, M denote respectively the lengths of the 
minimum hamiltonian cycle, the eulerian circuit, the minimum spanning tree 
and the matching. Then 


EC =MST+M, TSP > MST. 


The 2m vertices of M will occur, in some order, say 2),... , Zam, in the mini- 
mum length hamiltonian cycle. If for each i < 2m, we replace the part of the 
cycle between x; and 7;4 by the edge z;7;41, and we replace the part between 
Zam and x1 by the edge ram21, we obtain 


E(x, , 22) + (tg, %3) + +--+ €(t2m,01) < TSP 
where f(r, 2:41) denotes the length of the edge x;2;4,. Thus we have 


(€(r1, £2) + (a3, £4)+-+-+l(Lem—1,£am))+(E(2, £3)+- + €(Z2m,21)) < TSP. 
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So we obtain two matchings of z1,... ,Z2m whose lengths sum to < TSP. One 
of these matchings must have total length < i TSP, so that 


M< STSP. 


Thus EC = MST +M < TSP + $TSP = 2TSP. Thus, on using shortcuts in 
the eulerian circuit to avoid repeating vertices, we obtain a namilronign cycle 
whose length is < 3TSP. 


4.6 Eulerian Digraphs 


A digraph or directed graph is a graph in which each edge is assigned a 
direction, indicated by an arrow. In place of the degree of a vertex we have the 
indegree, the number of edges directed towards the vertex, and the outde- 
gree, the number of edges directed away from the vertex. 


Example 4.9 
b c 
a 
d 
f e 
Figure 4.13 


In Figure 4.13, the indegrees of a,... , f are respectively 1,2, 2,1, 2,0, and the 
outdegrees are 1,1,1,0,2,3. It should be clear why the sum of the indegrees 
equals the sum of the outdegrees. 

An eulerian circuit in a digraph is exactly what we would expect; it has to 
follow the directions of the arrows at each stage. If every vertex has its indegree 
equal to its outdegree then, as in Lemma 4.3, the edge set can be partitioned 
as an edge-disjoint union of directed cycles, and, as in Theorem 4.4, we obtain: 


Theorem 4.6 


A connected digraph has an eulerian circuit if and only if each vertex has its 
indegree and outdegree equal. 


Memory wheels 


It is said that the meaningless Sanskrit word 
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yamdtdrdjabhénasalagém 


has been used as a memory aid by Indian drummers. It has in it every 3-tuple of 
accented and unaccented vowels, each 3-tuple appearing once. We can display 
this by replacing unaccented vowels by 0 and accented vowels by 1, to obtain 


0111010001. (44) 


The 3-tupies 011, 111, 110, 101, 010, 100, 000, 601 appear in it in this order. Note 
that the last two digits of (4.4) are the same as the first two, so we can obtain, 
a “memory wheel” by overlapping the ends as shown in Figure 4.14. | 

| 


0 
— 
0 1 
1 1 
0 
Figure 4.14 


Now this arrangement achieves what a Gray code achieved, but much more 
efficiently. A sensor placed at the edge of the wheel can read off triples d 
digits and thereby determine how far the wheel has rotated. A Gray code far{ 
8 positions would require three circles of 8 digits, ie. 24 digits, whereas the 
memory wheel uses only 8. ' 
We now try to generalise this idea: can a circular arrangement of 2” binary 
digits be found which includes all 2” n-digit binary sequences? One approach 
might be via hamiltonian cycles. Since, in the above example, 110 is followed 
by 101, and 101 by 010, we could take the triples xyz as the vertices of a graph 
and join xyz and yzw by an edge to obtain the directed graph of Figure 4.15. 


001 011 
000 “ 111 
100 110 
Figure 4.15 


The directed hamiltonian cycle 


000 — 001 ~ 011 — 111 — 110 — 101 — 010 — 100 — 000 
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yields the memory wheel of Figure 4.14. The trouble with this approach, how- 
ever, is that it is not at all easy to see how to obtain a hamiltonian cycle in the 
corresponding digraph when n > 4. 

The problem was however solved by I.J. Good, in a 1946 paper in number 
theory. Instead by taking the triples as the vertices, Good took the triples as 
the edges of a graph, in which the vertices corresponded to the overlapping 
2-tuples. So, for n = 3, we form the digraph of Figure 4.16. 


01 


lll 


Figure 4.16 


Now in this digraph all vertices have indegree and outdegree equal, so the 
digraph contains an eulerian circuit. Once such circuit consists of the edges 


000 — 001 — 011 — 111 — 110 — 101 — 010 — 100 — 000, 


and this gives the same memory wheel as before. 

In general, take as vertices the (n — 1)-digit binary sequences, and draw a 
directed edge from 2122 ...2n_1 tO Z2...£p-12n, labelling the edge 2172... Zn. 
The resulting digraph has an eulerian circuit which yields a memory wheel. 


Example 4.10 


Obtain a memory wheel containing all 16 4-digit binary sequences. 


Solution 


Construct a digraph with 8 vertices labelled by the eight 3-digit binary se- 
quences, and draw a directed edge from 212223 to £2230 and to x2231. The 
digraph of Figure 4.17 is obtained. 

An eulerian circuit is (in terms of vertices) 


000 ~ 000 — 001 — 011 — 111 - 111-110-101 
—011 — 110 — 100 — 001 — 010 — 101 — 010 — 100 — 000 
ie. in terms of edges, 
0000 — 0001 ~— 0011 — 0111 — 1111 — 1110 — 1101 — 1011 
—0110 — 1100 — 1001 — 0010 — 0101 — 1010 — 0100 — 0000. 
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Figure 4.18 


The problem of constructing memory wheels is also known as the rotating drum 
problem. The circular binary sequences are often called maximum length 
shift register sequences, or de Bruijn sequences after the Dutch mathe 
matician N.G. de Bruijn who wrote about them in 1946 (although it turned out 
that they had been constructed many years before by C. Flye Sainte-Marie). 
They have been used worldwide in telecommunications, and there have bee 
recent applications in biology. 


Exercises 


Exercise 4.1 


(a) Strengthen Theorem 4.1 to: if a bipartite graph, with bipartition 
V = BUW, is hamiltonian, then |B| = |W]. 
(b) Deduce that Kim,n is hamiltonian if and only if m =n > 2. 
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Exercise 4.2 


For each graph in Figure 4.19, determine whether (a) it is hamiltonian, 
(b) it is eulerian, (c) it is semi-eulerian. 


(i) (ii) 


Bees 8 Voy 


(iii) 


Figure 4.19 


Exercise 4.3 


Which of the platonic solid graphs are (a) hamiltonian, (b) eulerian? 


Exercise 4.4 


Use the planarity algorithm to determine whether or not the graphs in 
Figure 4.20 are planar. 


Exercise 4.5 


Construct a Gray code of order 5. 


Exercise 4.6 


Dirac’s theorem. Prove Theorem 4.2 as follows. Suppose G is not 
hamiltonian. By adding edges we can assume that G is “almost” hamil- 
tonian in the sense that the addition of any further edge will give a 
hamiltonian graph. So G has a path v1, —-> v2 + --- + vp, through every 
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Figure 4.20 


vertex, where v, and vp are not adjacent. Show that there must be a ver- 
tex v; adjacent to v), with v;-; adjacent to v,. This gives a hamiltonian 
cycle uy +++ 3 Uj Un Dt FD i41 Ui UE- 


Exercise 4.7 


(a) Ore’s theorem. Imitate the proof of Dirac’s theorem to show that 
if G is a simple graph with p > 3 vertices, with deg(v) + deg(w) >p 
for each pair of non-adjacent vertices v,w, then G is hamiltonian. 

(b) Deduce that if G has 2+ $(p—1)(p— 2) edges then G is hamiltonian 

(c) Find a non-hamiltonian graph with 1+ 3(p — 1)(p — 2) edges. 


Exercise 4.8 


By removing vertex A , find a lower bound for the TSP for the graph of 
Exercise 3.7. Repeat, removing vertex B. Then obtain an upper bound 
by the method of Section 4.5. 


Exercise 4.9 


Find upper and lower bounds for the TSP for the situation in Exercise 
3.8. How do your results compare with the exact solution? 


ca 
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Exercise 4.10 


Construct a memory wheel containing all 32 5-digit binary sequences. 


Exercise 4.11 


Use digraphs to construct a memory wheel of length 9 containing all 
2-digit ternary sequences (formed from the digits 0,1, 2). Then find one 
for all 3-digit ternary sequences. 


Exercise 4.12 


Dominoes. Can you arrange the 28 dominoes of an ordinary set in a 
closed loop, so that each matches with its neighbour in the usual way? 
Can you do so if all dominoes with a 6 on them are removed? Can you 
state a general theorem about dominoes with numbers 0,1,...,n on 
them? (Hint: consider each domino as an edge of a graph with vertices 
labelled 0,1,... ,7.) j 


Exercise 4.13 


Figure 4.21 shows an arrangement of the numbers 1,... ,5 round a circle, 
so that each number is adjacent to every other number exactly once. Can 
you produce a similar arrangement for 1,... ,7? Use Euler’s theorem to 
show that there is a solution for n numbers if and only if n is odd. Can 
you salvage a similar type of result when n is even? 


Figure 4.21 


9) 


Partitions and Colourings 


In this chapter we consider partitions of a set, introducing the Stirling numbers 
and the Bell numbers. We then consider vertex and edge colourings of a graph, 
where the vertex set and the edge set are partitioned by the colours. 


5.1 Partitions of a Set 


A partition of a set S is a collection of non-empty subsets 5),...,5, of S 
which are pairwise disjoint and whose union is S. The subsets S; are called 
the parts of the partition. For example {1,2,4} U {3,6} U {5} is a partition 
of {1,... ,6} into three parts. Note that it does not matter in what order the 


Example 5.1. 


In a game of bridge, the 52 cards of a standard pack are distributed among 
four people who receive 13 cards each. In how many ways can the pack of 52 
cards be partitioned into four sets of size 137 


Solution 


We can choose 13 cards in (°3) ways. From the remaining 39 we can choose a 
further 13 in (33) ways, and then from the remaining 26 we can choose 13 in 
n) ways. This leaves a final set of 13 cards. So we have 


13 
52\ /39\ /26\ 52! «39! S26! i 
13/ \13/ \13/ — 13!39! 13126! 13/13! ~ (13!)4 
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ways of partitioning the pack. But these partitions are not all distinct, since 
each distinct partition arises in 4! ways, depending on which of the four sets in 
it is chosen first, which is chosen second, and so on. So the required number is 
52! 
(13!)44! 


(a vast number, greater than 102), 


There is another way of approaching this counting problem. Consider a row of 
52 spaces grouped into four groups of 13: 


The cards can be placed in the spaces in 52! ways. Within each group there are 
191 wens AL gerne ging the anme 12 ravda and these diffessnt arrancemants att 
ig: ways Ul all GILBIILE Re SQW 1 LGLUS, aNuU tnese airéer CiLu Gh L GME CICLLS ale 
irrelevant since they give rise to the same part of the partition, so we have to 
divide by (13!)*, one 13! for each group. Then the four groups themselves can 
be arranged in 4! ways, so we have to divide by 4!, giving the same answer as 
before. 


This argument easily generalises, to give the following result. 


Theorem 5.1 
A set of mn objects can be partitioned into m sets of size n in | 
(mn)! 
(n!)™m! : 


different ways. 


Corollary 5.2 


A set of 2m objects can be partitioned into m pairs in 
(2m)! 
2™m! 

different ways. 


Example 5.2 

The number of ways of pairing 16 teams in a football cup draw is 
16! 
388! = 2027025. 


The same type of argument can be applied when the parts of the required 
partition are not all of the same size. 


Example 5.3 


In how many ways can a class of 25 pupils be placed into four tutorial groups 
of size 3, two of size 4 and one of size 5? 
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Solution 


Consider the following grouping of 25 spaces 


(- = -)(- =) (=~) (===) (--=-)(- ---)(----- ). 


The 25 pupils can be placed in the spaces 25! ways. To count distinct partitions 
we have to take into account the ways of ordering the pupils within the groups - 
so we divide by (3!)*(4!)75! - and also the ways of ordering the groups themselves 
-so we divide by 4! on account of the four groups of size 3 and by 2! on account 
of the two groups of size 4. So the required number is 


25! 


* ~ 15 
(aiys(aryepiain ~ 26 x 10°. 


Definition 5.1 


A partition of an n-element set consisting of a; subsets of size i, 1 < i <n, 
where }>y_, ia; = n, is called a partition of type 172%? ...n%. 


Generalising Example 5.3 gives the following result. 


Theorem 5.3 
The number of partitions of type 1%12°? ...n%" of an n-element set is 
n} 
Thi, @)% ai! 


Example 5.4 


The number of ways of grouping 10 people into two groups of size 3 and one 
group of size 4 is the number of partitions of type 374! and so is 


10! 
@2ai = 2100. 
5.2 Stirling Numbers 
In this section we think about partitioning a set into a given number of parts. 


Definition 5.2 


Let S(n, k) denote the number of ways of partitioning an n-set into exactly k 
parts. Then S(n,k) is called a Stirling number of the second kind. 
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These numbers are named after the Scottish mathematician James Stirling 
(1692-1770), who is also known for his approximation of n!: 


nl ~ V2ann"e7”. 


Stirling also has numbers of the first kind named after him — see Exercise 5.10. 
We now study S(n,k). Clearly, for all n > 1, 


S(n,1) = S(n,n) = 1. (5.1) 


Example 5.5 


We show that S(4,2) = 7. Here are the seven ways of partitioning {1, 2,3, 4} 
into two parts: {1}U{2, 3, 4}, {2}U{1, 3, 4}, {3}U{1, 2, 4}, {4} U{1, 2, 3}, {1, 2}u 
{3,4}, {1, 3} U {2,4} and {1,4} U {2,3}. 


Clearly, for large n, we need a better way of evaluating S(n, k) than just writing 
down all possible partitions. Such a method is given by the following recurrence 
relation. 


Theorem 5.4 
S(n,k) = S(n-1,k -1) +k S(n -1,k) (5.2) 


whenever 1<k <n. 


Proof 

In any partition of {1,...,} into k parts, the element n may appear by itself 
as a l-element subset or it many occur in a larger set. If it appears by itself, 
then the remaining m — 1 elements have to form a partition of {1,...,n— 1} 


into k —1 subsets, and there are S(n — 1, k — 1) ways in which this can be done. 
On the other hand, if the element n is in a set of size at least two, we can think 
of partitioning {1,... ,m— 1} into & sets - this can be done in S(n — 1, k) ways 
- and then of introducing 7 into one of the k sets so formed - and there are { 
ways of doing this. So, by the addition and multiplication principles, we have 
S(n,k) =S(n-1,k-1) +k S(n—1,k). 


Example 5.5 (again) 
$(4, 2) = S(3,1) + 2.S(3,2) 


= 1 42(S(2,1) +239(2,2)) 


= 1 42(114+2) =7. 
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Theorem 5.5 
For all n > 2, S(n,2) = 2"-! -1. 


Proof 


We use induction on n. The result is true for n = 2, so suppose it is true for 
n=k> 2. Then 


L S(k + 1,2) = S(k,1)+2S(k,2) (by 5.2) 
=14-9(0""" =1) 

tet — 1, 9k 9 — 9({k41)-1 4 

F H=1l420-2=2 7° 7" —-i. 


Table 5.1 gives the first few Stirling number S(n, k). 


Table 5.1 
n\k | 1 2 3 4 5 6 7 8 | Bin) 


8 : 127 966 1701 1050 266 28 1 | 4140 


Note the number 2”~! — 1 in the column & = 2. On the right of the table 
are the sums B(n) of all the Stirling numbers in the rows. B(n) is the total 
number of partitions of an n-set, and is called a Bell number, after another 
Scot, E.T. Bell, who emigrated to the USA. and wrote several popular books 
on mathematics, including Men of Mathematics, an idiosyncratic two-volume 
collection of “biographies” of famous mathematicians. We have, for n > 1, 


yo B(n) = ’ S(n, k). (5.3) 


If we define B(O) = 1 = S(0,0) (accept this as a useful convention, like (3) = 1) 
we can obtain a recurrence relation for the Bell numbers. 


? 
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Theorem 5.6 
n-l1 

For alin >1, Bin) = oF *) Bib) 
k=0 


The nth element of the set being partitioned will appear in one of the sets of 
the partition along with j > 0 other elements. There are Ce) ways of choosing 


eament«e Whe remaining 7 1 3 slements mn than he nartitioned 
if wu be 


9, ee eae | Sand in 
these J CACMETUS. 2HE TEMGAININE 7% — 2 — 7 ClCMiCeies Paruiti0leca ii 


B(n —1-— j) ways. So 


Example 5.6 ¢ 


s (;) 2 


14 8.1 +4 28.2 + 56.5 + 70.15 + 56.52 + 28.203 + 8.877 + 1.4140 


B(9) 


Hl 


= 21147. 


For an interesting (but useless!) formula for B(n), see Exercise 5.9. 


5.3 Counting Functions 


The Stirling numbers arise naturally in the enumeration of all functions 
f : X — Y which can be defined from an m-set X to an n-set Y. There 
are n™ such functions since, for each x € X, there are n possible values for 
f(z). 

Recall that the image of f : X — Y is the set of elements of Y which actually 
arise as a value f(x) for some z € X: 


imf = {y€ Y :y = f(x) for some z € X}. 


Each function f : X — Y has as its image a subset of Y. How many such 
functions have an image of size k? If f takes precisely k values then X can be 
partitioned into k parts, the ith of which will consist of those elements of X 
which are mapped onto the ith member of im(f). So a function f : X +Y 
with image of size k can be constructed as follows: 
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(i) partition X into k parts X),...,X, (this can be done in S(m, k) ways); 


(ii) choose the image set of size k in Y (this can be done in (7) ways); 


(ii) pair off each X; with one of the members of the image set (this can be done 
in k! ways). 


are MT Lean ned q 
are m@ TuNnctions 


adince L con take any Vasa Lenn, 


Theva l. waly 
LnUS, Since aA CaN tanke any Varuc irOmi 


So the number of functions f : X -+ Y with image of size k is S(m,k)(?)k!. 
1 a 


tn 
a6) 


f:X 7 Y altogether, we obtain: 


Theorem 5.7 
Let |X| = m and |[Y| =n where m,n > 1. 


(a) The number of functions f : X -+ Y with image of size k is S(m, k) (7) k!. 
(b) 


n™ — 3 S(m, k) (i) kl. (5.4) 


Note as a special case that the number of surjections from X to Y, i.e. func- 
tions whose image set is the whole of Y, is n!S(m,n). 


Example 5.7 
We check (5.4) in the case n = 4,m = 5. 


4 
3 5(5,k) (3) k! = 45(5, 1) + 125(5, 2) + 245(5, 3) + 245(5, 4) 
k=1 . 


Note that if we define S(m,0) = 0 for all m > 1, and S(0,0) = 1, then we can 
rewrite (5.4) as 


k=0 
This identity can be inverted. 
Theorem 5.8 
For alm >1n2>0,m>n, 
n 
= S(-yr-k(™ pm 5.5 
nistmnyn) = o(-1y"*(2)& (5.5) 


k=0 
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Proof We can use Corollary 1.15, putting a, = k™ and b, = S(m,k)k!. 


Alternatively we shall be able to use the inclusion-exclusion principle in the 
next chapter: see Section 6.2. 


Example 5.8 


k=0 


3 
aa i (2-9 () i) = 5(-0 +3- 3.2% 43°) = 25. 


5.4 Vertex Colourings of Graphs 


To colour the vertices of a graph G is to assign a colour to each vertex in such 
a way that no two adjacent vertices receive the same colour. If we define an 
independent set of vertices of G to be a set of vertices no two of which are 
adjacent, then a vertex colouring can be thought of as a partition of the set V 
of vertices into independent subsets. Often we are concerned with the smallest 
number of colours required, i.e. the smallest number of independent sets which 
partition V; we call this number the chromatic number of G. 


Definition 5.3 


The chromatic number x(G) of a graph G is the smallest value of k for which 
the vertex set of G can be partitioned into & independent subsets. 


We have met the idea of colouring vertices already; in Section 3.5 we noted that 
bipartite graphs are bichromatic; so if G is bipartite with at least. one edge then 
x(G) = 2. Also, the four colour theorem asserts that x(G) < 4 for all planar 
graphs G. 


Theorem 5.9 
(i) x(Kn) =n. 
(ii) x(C,) = 2 if n is even; x(C,) = 3 if n is odd. 


Proof 
(i) No two vertices can receive the same colour since they are adjacent. 


(ii) If n is even, we can alternate colours round the cycle; if n is odd we need 
a third colour for the “last” vertex coloured. 
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Example 5.9 


The graph of Figure 5.1(a) has chromatic number 3; it needs at least three 
colours since it contains C3, and three colours are sufficient, as shown in Fig- 
ure 5.1(b). 

1 


ee Co. ae oe 


2 2 
Figure 5.1 


Nota that tha enan nf Om ndd eanteadieta tha holinf nf anmn amataoue faur_ 
NOE TNAL tME CaSC OF On, Cad, CONUTACICTS tNC OCiici OF Some amateur rour 


colour-theorem-provers, that a graph needs m colours only if it contains K,, as 
asubgraph. Another counterexample to this belief is the graph of Figure 5.1(c) 
which needs four colours (why?) although it does not contain K4. 

There is no easy way of finding y(G) for a given graph G. The greedy algo- 
rithm, which we now describe, will give an upper bound for y(G) related to the 
maximum vertex degree. In our description of the algorithm we denote colours 
by Ci, C2,C3,... and call C; the ith colour. 


The greedy algorithm for vertex colouring 


. List the vertices in some order: v1,... , Up. 


1 
2. Assign colour C| to v1. 


we 


At stage i+ 1, when v; has just been assigned 


stage t fen fids Just Deer Signe 


ac 
colour C; with j as small as possible which has no 
a vertex adjacent to vi41. 


Example 5.10 


We use the greedy algorithm to colour the graph of Figure 5.2 for each of the 
two vertex orderings shown. 
With vertices listed as in (a), we assign colours as follows: 


vy: 1 2 3 4 5 6 7 
Cc. 12 1 3 4 «1 «2 


This colouring uses four colours. However, with the vertices labelled as in (b), 


we get: 
v: 1 2 3 4 56 6 7 


C: 12 3 1 3 «21 2 
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(a) (b) 
Figure 5.2 


This second colouring shows that x(G) < 3; in fact x(G) = 3 since G is not 
bipartite. 

Clearly, the bound for x(G) obtained by the algorithm depends on the par 
ticular order in which the vertices are considered. But note that, if a vertex » 
has degree d then, when it is the turn of v to be assigned a colour, at most d of 
the colours are ineligible, so it must be given some colour C; where i < d+1. 
Thus we have the following bound. 


Theorem 5.10 


If G has maximum vertex degree A, then the greedy algorithm will colour the 
vertices of G using at most A + 1 colours, so that y(G) < A+1. 


Example 5.11 (A timetabling problem) 


The University of Central Caledonia has nine vice-principals, Professors 
A, B,...,I, who serve on eight committees. The memberships of the com- 
mittees are as follows 


as follows. 


Committee1: A, B, C, D 5: A, A, J 
2: A, C, D, E 6: HA, I, J 
3: B, D, F, G 7: G, HA, J 
4: C, F, G, A 8: E, I 


Each committee is to meet for a day; no two committees with a member in 
common can meet on the same day. Find the smallest number of days in which 
the meetings can take place. 


Solution 


Represent each committee by a vertex, and join two vertices by an edge precisely 
when the corresponding committees have overlapping membership. Then the 
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2 
8 1 
6 3 
5 Vee 4 
7 
Figure 5.3 


minimum number of days required is the chromatic number of the graph G, 
shown in Figure 5.3. Note that vertices 1,2,3,4 form a K4, so at least four 
colours (days) are needed. But four colours are sufficient: e.g. 


{1, 7,8} U {3,5} U {2,6} U {4} 


is a partition of {1,... ,8} into independent sets. So x(G) = 4, and four days 
are enough. 


5.5 Edge Colourings of Graphs 


An edge colouring of a graph G is an assignment of colours to the edges of 
G so that no two edges with a common vertex receive the same colour. The 
minimum number of colours required in an edge colouring of G is called the 
chromatic index of G and is denoted by x'(G). 

Thus to edge colour a graph is to partition the edge set into subsets such that 
no two edges in the same subset have a vertex in common, i.e. so that all edges 
in any part of the partition are disjoint. A set of disjoint edges in a graph is 
often called a matching. Clearly, in an edge colouring, all edges at a vertex v 
must receive different colours, so y'(Kn) > n— 1 for each n. 


Example 5.12 


(a) y'(K4) = 3, since y’(K4) > 3 and three colours suffice, as shown in Figure 
5.4(a). 

(b) x'(Ks) = 5. Here, A = 4 colours are not enough. For there are 10 edges and 
no more than two edges in any matching. However, 5 colours are enough, 
as shown in Figure 5.4(b). 
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Figure 5.4 
Theorem 5.11 
(i) x’(Kn) = 7 if n is odd. 


(ii) x’(Kn) =n — 1 if n is even. 


Proof 


(i) If mis odd, any matching in Ky can have at most 5(n— 1) edges. So at most 
3(n — 1) edges can be given any one colour. But there are 4n(n — 1) edges 
in Ky, so at least n colours are needed. Now we can colour the edges using 
n colours in the following way. Represent K,, as a regular n-gon, with all 
diagonals drawn. Colour the boundary edges by 1,... ,n; then colour each 
diagonal by the colour of the boundary edge parallel with it. This gives an 
edge colouring using n colours. The case n = 5 is as in Figure 5.4(b). 

(ii) Now suppose n is even. Certainly y/(Kn) > n—1; we show how to use only 
n—1 colours. Since n— 1 is odd, we can colour K,_1 using n — 1 colours, as 
described above. Now take another vertex v and join each vertex of K,_; 
to v, thus obtaining Kp. At each vertex of K,_1, one colour has not been 
used. The colours missing at each vertex of K,_, are all different, so we 
can use these n — 1 colours to colour the added edges at v. This gives an 
edge colouring of K, using n — 1 colours. 

The appearance of A(= n—-1) and A+ 1(= 1) as the chromatic indices of Ky, 


according as n is even or odd, is in accordance with the following result. 


Theorem 5.12 (Vizing, 1964) a 


If G is a simple graph with maximum vertex degree A, then ’(G) = Aor 
4+1. 


We omit the proof of this result; a proof can be found in [9]. But we include the 
statement of the result because it has led to a great deal of work on determining 
which graphs are class one graphs, i.e. satisfy x/(G) = A, and which are class 
two, i.e. satisfy y/(G) = A+ 1. 
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Example 5.13 


The Petersen graph is class 2. Here A = 3, so we have to show that y’(G) # 3. 
So suppose an edge colouring using only three colours exists. Then the outer 
5-cycle uses three colours, and, without loss of generality, we can assume that 
it is coloured as in Figure 5.5(a). The spokes are then uniquely coloured, as in 


(a) (b) 
Figure 5.5 


Figure 5.5(b). But this leaves two adjacent inside edges which have to be given 
colour 2. So there is no edge colouring with three colours. 


We close this section by establishing that all bipartite graphs are class 1. This 
result is due to Konig, the Hungarian author of the first major book on graph 
theory [14]. 


Proof 


Proceed by induction on q, the number of edges. The theorem is clearly true 
for graphs with g = 1; so suppose it is true for all bipartite graphs with k 
edges, and consider a bipartite graph G with maximum vertex degree A and 
with k + 1 edges. Choose any edge vw of G, and remove it, thereby forming a 
new bipartite graph H. H has k edges and maximum vertex degree < A, so, 
by the induction hypothesis, H can be edge coloured using at most A colours. 

Now, in H, v and w both have degree < A — 1, so there is at least. one colour 
missing from the edges from v, and at least one missing from the edges of w. 
If there is a colour missing at both vertices then it can be used to colour edge 
vw. If there is no colour missing from both, then let C; be a colour missing at 
v, and C) a colour missing at w. Now there is some edge, say vu, coloured C2; 
if there is an edge coloured C, from u, go along it, and continue along edges 
coloured C) and C2 alternately as far as possible. The path so constructed will 
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never reach w since if it did it would have to reach w along an edge coloured 
C; and so would be a path of even length, giving, with edge vw, an odd cycle 
in a bipartite graph. So the connected subgraph K,, consisting of vertex v and 
all vertices and edges of H which can be reached by a path of edges coloured 
C and C2, does not contain w. So we can interchange the colours C, and C, 
in K without interfering with the colours in the rest of H. This gives a new 
edge colouring of H in which v and w have no edge coloured C2, and we can 
use C2 to colour uw. 


This idea of swapping colours along a path was used by Kempe in his unsuc- 
cessful 1879 attempt to prove the four colour theorem. Despite the fact that 
it did not work there as Kempe had hoped, it nevertheless has proved to bea 
very useful technique in graph theory. 


Example 5.14 


Eight students require to consult certain library books. Each is to borrow each 
required book for a week. The books B; required by each student S; are as 
follows: 


S,:B,,Bo,Bz; S29: Bo, Ba, Bs,Be  —S3 : Bo, Bs, Bs, By 
Sa 7 Bs, Bs Ss : By, Be, Br Se : Bo, Ba, Bg 
Sy : Ba, Bs, Br Ss . B3, Be. 


What is the minimum number of weeks required so that each student can 
borrow all books required? 


Solution 


Draw a bipartite graph G with vertices labelled B,,... ,Bz,S),... ,Sg, and 
with 5S; joined by an edge to B; precisely when student S; has to consult 
book B;. Then G has maximum vertex degree A = 4, so, by K6nig’s theorem, 
x'(G) = 4. Thus four colours (weeks) are required. You should be able to 
partition the set of edges into four disjoint matchings. 


Exercises 


Exercise 5.1 


How many ways are there of arranging 16 football teams into four groups 
of four? 


te Mathematics 


edge coloured 
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Exercise 5.2 


A class contains 30 pupils. For a chemistry project, the class is to be put 
into four groups, two of size 7 and two of size 8. In how many ways can 
this be done? 


Exercise 5.3 


In the early versions of the Enigma machine, used in Germany in the 
1930s, the plugboard swapped six pairs of distinct letters of the alphabet. 
In how many ways can this be done (assuming 26 letters)? 


Exercise 5.4 


Any permutation is a product of cycles. For example, the permutation 
351642 (3 > 1,5 + 2,1 — 3,6 > 4,4 > 5,2 — 6) can be written 
as (31)(2645). How many permutations of 1,...,8 are a product of a 
1-cycle, two 2-cycles and a 3-cycle? 


Exercise 5.5 
Prove that (a) S(n,n —1) = (3), (b) S(n,n — 2) = ($) +3(9). 


Exercise 5.6 


Prove by induction that S(n,3) > 3"~? for all n > 6. 


Exercise 5.7 


Show that S(n,k) = °7—4,_, ("2")S(m, k—1) and hence given another 
went at Phanea. be 
pavut UL LICULCI YU, 


Exercise 5.8 
Find B(10). 


Exercise 5.9 


Use Theorem 5.6 and induction to prove that B(n) = 47%, 4. 
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Exercise 5.10 


The (signless) Stirling numbers s(n, k) of the first kind are defined by: 
s(n,k) is the number of permutations of 1,... ,n consisting of exactly 
k cycles. Verify that s(2,1) = 1,s(3,1) = 2,s(3,2) = 3, (4,2) = 11 and 
that s(n,1) = (n —1)!. Prove that s(n, k) = (n —1)s(n —1,k) + s(n—-1, 
k — 1), and deduce the value of s(6, 2). 


Exercise 5.11 
Find x(G) and x’(G) for each of the graphs of Exercise 4.2. 


Exercise 5.12 


Let G be a graph with p vertices and let a(G) denote the size of the 
largest independent set of vertices of G. Show that y(G)a(G) > p. 


Exercise 5.13 


Apply the greedy vertex colouring algorithm to the graph of Figure 5.3, 
taking the vertices (a) in the order 1,... ,8, (b) in order 8,... ,!. Do 
you get a colouring using four colours? 


. . a 
Exercise 5.14 
As Exercise 5.13, but this time choose vertices in (a) increasing, (b 
decreasing order of vertex derresns. Which annroac auld wow oynant to 
decreasing ordcmof vortex desress "Which approach would you cipect ts 


require fewer colours in general? 


Exercise 5.15 


Explain why there is always an ordering of the vertices for which the 
greedy algorithm will lead to a colouring with y(G) colours. 


Exercise 5.16 


Find the chromatic index of each of the five Platonic solid graphs. 
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Exercise 5.17 


A graph in which every vertex degree is 3 is called a cubic graph. Prove 
that all hamiltonian cubic graphs have chromatic index 3. (Note how- 
ever that not all cubic graphs have chromatic index 3, e.g. the Petersen 
graph.) 


Exercise 5.18 


Let G be a graph with an od 
which has the same degree r. 
(a) Show that G has (k + })r edges. 

(b) Explain why no more than k edges can have the same colour in any 
edge colouring, and hence show that y/(G) =r +1. Thus every reg- 
ular graph with an odd number of vertices is class 2. (This includes 
Ky,n odd, as shown in Theorem 5.11.) 


jay 
i] 
5 
fom 
o 


Exercise 5.19 


Let f,(G) denote the number of ways of colouring the vertices of G using 

d given colours. 

(a) Show that fax(Kn) = AA - 1)(A- 2)... A-—n41). 

(b) Show that f,(T) = A(A -— 1)? for all trees T with n vertices. 

(c) Let zy be any edge of G. Let G’ be the graph obtained from G by 
removing the edge zy, and let G” be the graph obtained by identi- 
fying vertices x and y. Then f,(G) = f,(G’) — f,(G”). Deduce that 
fx(G) is a polynomial in A: it is called the chromatic polynomial 
of G 

(d) Note that the solution a, = 2” + (—1)"2 of Example 2.4 can be 
interpreted as: f3(C,) = 2” + (-1)"2. By replacing 3 colours by 
colours, show similarly that f,(C,) = (A — 1)" + (—1)"(A — 1). Note 
that this gives fo(C,) = 0 whenever n is odd, as expected! 
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The Inclusion—Exclusion Principle 


In this chapter we discuss a method of counting which has been used for at 
least 300 years. One of its first uses was in the study of derangements; as well 
as this, we give many applications including labelled trees, scrabble and the 
ménage problem. 


6.1 The Principle 


The principle is essentially a generalisation of the following simple observation. 
Suppose we are given two sets A and B, and are asked for the number of 
elements in their union. A first attempt might be |A] + |B], but elements in 
both A and B are counted twice; so the corrected estimate is 


JAU B| = |A| + |B] - |AN BI. (6.1) 


Note that we first include, and then exclude all those which have been in- 
cluded too often. 
Even this simplest form of the principle can be useful. 


Example 6.1 


In a class of 50, there are 30 girls, and there are 35 students with dark hair. 
Show that there are at least 15 girls with dark hair. 


Solution 


Let A denote the set of female students, and B the set of students with dark 
hair. Then 
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|ANB| |A| + |B] — |AU B] = 30+ 35 — |AUB} 


> 65-50=15 since |A U Bl < 50. 


The next application of (6.1) is much less trivial. 


Example 6.2 


We look for the smallest possible value of m such that if G is a graph on 60 
vertices, each vertex having degree at least m, then G must contain Ky asa 
subgraph. 

It is not hard to see that m must be greater than 40. For if we take G to be 
the graph K29,20,20, with vertex set V = Vi UV) U Va, |V;| = 20 for each i, all 
vertices of V; being joined to all vertices of V; whenever 1 # 7, then each vertex 
will have degree 40; but no Ky exists in G since any K, in G would have to 
have two vertices in the same V;. We now show that m = 41. Let G be any 
graph on 60 vertices, each of degree > 40. Choose any vertex v,, and let S; be 
the set of vertices G adjacent to v,. Then |$,| > 40. Take any vertex v2 in 5,, 
and let S2 be the set of all vertices adjacent to v2. Then 


|S1 9 So| = |S1| + |S} — [$1 U So] 
> 404 40 - |S; U Se] > 80 - 60 = 20. 


So [$1 N S2| > 20. Next take any vg € 5S; S2, and let S3 be the set of all 


vertices of G adjacent to v3. Then 
[519 S2MS3| = |(S1 9 S2) MN S3| [51M Se] + [Ss] — |(S1 M S2) U $3| 


> 20+40-60=0. 


§,9 S29 S5. But then v1, v2, v3, 14 are all 
17520 Ss 1, U2, U3, U4 


We now extend (6.1) to three sets A,B,C as shown in Figure 6.1. Our first 
estimate for |AU BU C| might be |A| + |B| + |C]. But elements in more than 
one set will have been included more than once, so a second estimate might 
be [A] + |B] + |C| — |AN Bl — |BNC} - |CNA|. But elements in all three sets 
A, B,C will have then been included thrice and excluded thrice, and so have 
to be included once more. So finally we get 


|AUBUC| = [A] + |B] + |C] — [ANB] - [BNC] — |Cn Al + |ANBNC]. (6.2) 


Note that we include, then exclude, then include again. 


The general formulation is as follows. 4 
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A 


B Cc 
ee ae 


Figure 6.1 


Theorem 6.1 (The inclusion-exclusion principle) 


Let S be a set of objects, and let P,,... ,P, be properties which the elements 
of S may or may not possess. Let N(i,j,... ,&) denote the number of elements 
of S which possess properties P;, P;,... , P, (and possibly some others as well). 
Then the number of elements of S possessing at least one of the properties is 


NG) -SONGA+ DO NGI) —...+(-1)7N(1,2,... 57). (6.3) 
i i<j i<j<k 
ee 

Proof 
Any element of S which has none of the properties contributes 0 to each term 
of (6.3) and hence contributes 0 to the sum. 

Now consider an element of S which possesses t > 1 of the properties, where 
t<r. It contributes to ¢ of the terms N(i), to (5) of the terms N (i,j), and so 
on. So its total contribution to (6.3) is 


t t t-ifé =. + t meee te fe t(t 
(Goer ee (sen 
=1-(1-1=1. 


Example 6.3 


Find the number of positive integers < 100 which are divisible by 3 or 7. 


Solution 


Take S = {1,... ,100}, let P, be the property of being divisible by 3 and let 
P, be the property of being divisible by 7. Then N(1) is the number of integers 
< 100 which are divisible by 3, so N(1) = 33. Similarly N(2) = 14. Finally 
N(1, 2) is the number of such integers divisible by both 3 and 7, i.e. divisible 
by 21; so N(1,2) = 4. Thus the required number is 33 + 14 — 4 = 43. 
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The principle is in fact more often used in a slightly different form. Instead of 
asking how many elements have at least one of the properties, we ask how 
many have none of the properties. 


Theorem 6.2 (Second version of the inciusion-exclusion principle) 


With the same notation as in Theorem 6.1, the number of elements of S pos 
sessing none of the properties is 


[S| — $° (i) + 30 NG, 9) — -.. + (-DN(,2,..- 7). (6.4) 


t<j 


Example 6.4 


How many positive integers < 100 are divisible by none of 2, 3,5, 7? 


Solution 


Here again S = {1,... ,100} and P,,... , Py are the properties of being divisible 
by 2,3, 5,7 respectively. Then (1) = 50, N(2) = 33, N(3) = 20 and N(4) = 
14. Next, N(1,2) is the number of elements of S divisible by 2 and 3, i.e. by 
6, so N(1,2) = 16. Similarly, for example, N(1,3,4) is the number of positive 
integers < 100 divisible by 2,5 and 7, i.e. by 70, so that N(1,3,4) = 1. Thus 
the required number is 


100—(50+33+20414)+(16+410+7+6+4+2)—(3+24+1) = 100-117+45-6 = 22 


What is the significance of this result? Observe that any number < 100 which 
is not prime must possess a factor < V100 = 10, and so must be divisible by 
a prime < 10, ie. by 2,3,5 or 7. The answer 22 just obtained is not quite the 


number of primes < 100 since it dees not include the primes 2, 3,5, 7 themselves, 


and it also includes 1 which is not prime. So the number of primes < 100 is 
22+4-1= 25. Check it! 


Example 6.5 


In one version of the Enigma machine used for encoding secret messages, three 
“rotors” were chosen from a set of five, and were placed in order in the machine. 
On each day a different ordered set of three was chosen, such that no rotor was 
in the same position as on the previous day. Given the arrangement for one 
day, how many possible ordered choices are there for the next day? 


Solution 


Let us suppose that the rotors are labelled 1,... ,5, and that on a given day 
rotors 1,2,3 are chosen, in that order. Let S denote the set of all possible 
ordered sets of three rotors; then |S| = 5 x 4 x 3 = 60. For each i < 3, let 
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P; denote the property that rotor i is in position 1. Then we want the number 
of members of S possessing none of the properties P;, P2, P3. So the required 
number is 


IS] - 3° N@ + 3°NG,A) - N(1,2,3). 
Now N(i) = 4 x 3 for each i < 3, and each N(i,7) = 3; so the answer is 
60 —3 x (4x 3)+3x3—1=32. 


(Later on the number of rotors used by the German navy was increased to 8; 
see Exercise 6.9.) 
The next example deals with derangements, already met in Chapter 2. 


Example 6.6 


Recall that a derangement of n objects is a permutation of them with the 
property that no object is in its original place. We now show how to use the 
inclusion—exclusion principle to derive the formula (2.10) for d,, the number of 
derangements of n objects. 

Take S to be the set of all permutations of 1,...,n and, for each i < n, let 
P; be the property: i is in the ith position. Then d,, is the number of members 
of S possessing none of the properties P;. Now, for each 1, N(i) = (n- 1)! since 
iis fixed and the remaining n — 1 numbers can be permuted in any way we 
like. Similarly, N (i,j) = (n —2)!, and so on. Note also that there are (7) terms 
N(i), (3) terms N(i,j), etc., so that, by (6.4), 


dy, = |5|— SO N(i) + ONG A) — + (HNP N(L-. 0) 


i<j 
=n (Thin t+ (B)en- 29 +-on("Jor 
nr 
=n! mye ein 
=nft— a4 pCO), 


The next example introduces ideas which will be useful in section 6.4. 


Example 6.7 


How many non-negative integer solutions are there of the equation 2+y+z = 20 
satisfying the conditions z < 10,y < 5,z < 157 


Solution 


Let S denote the set of all non-negative solutions of r+y+z = 20. By Theorem 
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1.11, |S] = (7?) = (2); Let Py be the property x > 11; let P, be the property 
y > 6, and P; the property z > 16. Then we want the number of members of 
S possessing none of the properties P;. This number is 


G = LNW + 3° N(i,9) — NQ,2,3). (6.5) 
i<j 

Consider N(1). If z > 11,2 = 11+ for some u > 0, and the equation becomes 
u+y+z= 9. The number of non-negative integer solutions of this equation 
is N(1) = ('}) = (72). Similarly, N(2) = ('7), N(3) = (8). Next, N(1, 2) is the 
number of solutions satisfying > 11 and y > 6; putting zx = ll+u,y=6+41 
changes the equation into u + v +z = 3, and the number of non negative 
solutions is N(1,2) = (2) Since N(1,3) = N(2,3) = N(1, 2,3) are all clearly 


2ern we Bed that (& } becomes 
Zero, WE 1G wnat (oO occomes 


(2)-(")-()-()+Q=a 


6.2 Counting Surjections 


Let |X| =m and |Y| =n, and consider functions f : X + Y. The image of | 
was defined in Section 5.3 to be the set of all elements of Y which arise as f(z) 
for some xz € X. We say that f is surjective if the image of f is the whole of 
Y. How many surjections f : X — Y are there? 

Let S denote the set of all functions f : X + Y, where Y = {y:,... yn}, and 
let P; be the property: y; is not in the image of f. Then N(1) = (n—1)™, since 
each of the m elements of X can be mapped by f ee ee of the n — 1 other 


elements of Y. Similarly, N(i1,... ,t~) = (n — k)™. So by (6.4), the number of 
aQuirian tinna fram VY tn V i a. the numbhar of mamhare of oe manageading mane of 
surjections 1aUill #2 UU 3 ae ULID EAUALALIVOCL Uh LUD Us i pYyooeoso. 41k 4AVLiIT UL 


the properties P; is 
[S| - LO) + S°N(i,9) — + + (-1)"N (1... 40) 


i<j 


ann ~ (Tier + (3) (n- 29" -- $(=1)" (")m- n)™ 


ae *(F)m-a". 


ix=0 
One immediate consequence of this result arises from the observation that, if 
n > m, there are no surjections f : X — Y since there are not enough elements 
in X to be mapped onto the n elements of Y. So we obtain: 
n-1 
S(-ni |) (n-i)™=0 whenever m <n. (6.6) 


i=0 
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Another consequence arises from the fact that the number of surjections from 
X to Y is n!S(m,n) (as in Theorem 5.7); so we obtain another proof of (5.5). 


6.3 Counting Labelled Trees 


It was mentioned in Section 3.3 that Cayley proved that the number of labelled 
trees on n vertices is n™~?. The three trees on 3 points were shown in Figure 
3.7. Many proofs of Cayley’s result have been given. Most of these proofs are 
quite tricky; the most common one, due to Priifer, depends upon constructing 
a one-to-one correspondence between the trees and ordered (n — 2)-tuples of 
numbers each of which can be any of 1,... ,n. However, we are here going to 
use ideas of J. Moon and employ the inclusion-exclusion principle and (6.6). 

Let S denote the set of all spanning trees on the vertices labelled 1,... ,n, 
and let |S] = T(n). For each i < n, let P; be the property: vertex ¢ is an end 
vertex. Since, by Theorem 3.3, every tree with p > 2 vertices has an end vertex, 
every member of S must possess at least one of the properties P;. Further, if 
n > 3, no tree on n vertices can have every vertex an end vertex, so no member 
of S can possess all n properties. So, by (6.3), for n > 3, 


T(n) = YING) -— PONG +--+(-Y" DY NGG sina) 


i<j i1<..<in-1 


Now N(t) = (n — 1)T(n— 1), since if vertex 7 is an end vertex, its edge can go 
to any of the other n — 1 vertices, and these n — 1 vertices are joined by a tree. 


ann. 


2 labelled tree 
on n — 2 vertices 


Figure 6.2. N(1,2) = (n —2)°T(n — 2) 


Similarly, as illustrated in Figure 6.2, N(i,j) = (n — 2)?T'(n — 2), and so on. 
So, for n > 3, 


T(n) = @ (n-1)T(n-1)- (3) (n — 2)?T(n — 2) 


n 


ee (-1)" & )ra) 
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n-l : , 
= So(-y! (") (n — i)'T(n — i). (6.7) 
i=l 
However, putting m =n — 2 in (6.6) and rearranging slightly, we get 
n~1 
n—2 _ _4)i-1f™ _ \ifm — g\n—i-2 
n°’? = 5 °(-1) (T)m i)'(n — i) ; (6.8) 


t=1 
Compare (6.7) and (6.8). If the formula T(k) = k*-? is known to be true for 
all kK up to n — 1, then the right-hand sides of (6.7) and (6.8) coincide, and it 
therefore follows that T(n) = n"—?. Thus, since the result is true for n = 3, it 
follows by induction that T(n) = n”~? for all n > 3. 


6.4 Scrabble 


Scrabble is a word game in which players take turns to use the letters in their 
possession to form new words. At each stage of the game, each player has 7 
tiles, each tile having on it a letter or being blank. The distribution of letters 
is as follows. 


A BCD EF G#HI J KLM 
9 2 2 4 12 2 3 2 9 1 #1 4 2 


N OPQ RS TU VW X Y Z_Oblank 
6 8 2 1 64 6 4 2 2 1 2 1 2 


At the start of the game, each player chooses 7 tiles. How many ways are there 
of choosing 7 tiles? 

If we let a denote the number of As, 6 the number of Bs, ..., z the number 
of Zs, and w the number of blanks, then the number of possible choices of 7 
tiles is just the number of solutions of the equation 


at+b+...¢z4+w=7 (6.9) 


in non-negative integers, with a < 9,b < 2, and so on. So, to apply the inclusion- 
exclusion principle, take S to be the set of all non-negative integer solutions 
of (6.9), take P, to be the property that a > 10, P, to be the property that 
6 > 3, and so on: then we want the number of members of S with none of the 
properties, as given by (6.4). We proceed as in Example 6.7. 

By Theorem 1.11 , |S| = (77*77') = (7). 

Next consider N(a), the number of solutions of (6.9) with a > 10. This is 
clearly 0. 


®. 
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Now consider N(b). This is the number of solutions of (6.9) with b > 3. If we 
put 6 = b' +3, (6.9) becomes a+ (b'+3)+...4+z+w = 7, ie. a+b'+...4w =4, 
and so N(b) = (?"t4~1) = (7). 

In the same way, we obtain 


N(b) = N(c) = N(f) = N(h) = N(m) = N(p) = N(v) = N(w) 
= N(y) = Nw) = (%), 


f31y 


AY 


G) = N(k) =Niq = Nis) = N(z) = (5), 
N(9) = (3), N(@) =N@) = N(s) = Nw) = (3), 
(n) = N(r) = N(t) = (7°) = 1, and all others are zero. 


We next have to deal with terms such as N(c,d). There are (ee such terms, but 
fortunately many are zero; for example N(c,d) is zero since the requirements 
c>3 and d > 5 are incompatible with (6.9). 

There are (1?) terms equal to N(b,c). Putting b = b’+3 and c= c' +3 leads 
to the equation a+ bf +c’ +d+...+w = 1, which has Gy solutions. Similarly 
there are 5 terms equal to N(g,j). Putting g = g' + 4 and j = j’ + 2 leads to 
the equationa+...+g' +h+i+j’+...+ = 1, again with (7") solutions. 
There are (3) = 10 terms equal to N(j, k),50 = 10 x 5 terms equal to N(b, j); 
10 like N(b, g), 20 like N(d, 7). 

Finally, there are (3) terms equal to N(j,k,q), and (3) x 10 terms equal to 
N(b,j,k). So the required number is 


(7) {9(5) +20(4) + (3) +4(2) +9(6)} 
+{()(2) +90(2) + (2) (2) +87) +90(c)} 
~{(3)(@) (6) 


= 3199724. 


6.5 The Ménage Problem 


The ménage problem was posed by the French mathematician E. Lucas in 1891. 


Problem. In how many ways can n married couples sit round a table (with 
labelled seats) so that men and women alternate, and no husband and wife sit 
next to each other? : 
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Solution 


Let us be gentlemanly, and seat the ladies first. Observe that the wives 
W,.--,Wn will occupy either the odd numbered seats or the even numbered 
seats; so the number of ways of seating the wives is twice n!. Then for each 
of these 2(n!) arrangements, there will be the same number g(n) of ways of 
seating the husbands. The problem is now to evaluate g(n). Without loss of 


generalty we can sunnose that the wives are sented as chown in Picure 
enerarjity, We Can 3u ose that the wives are seated as shown in Figure 6.3. 
2 5 ] 
W 1 
n a, 1 
Wa W2 


Figure 6.3 


with the vacant seats now labelled 1,... ,n, seat i being between wives W; and 
Wi41 (i <n), and seat n being between W,, and Wj. 

Let S be the set of all possible ways of arranging the husbands in the n seats, 
so that |.S| = n!. Then consider the following properties that elements of 5 may 
or may not possess: 


P, . H; is in seat 4; 
Q;: H; is in seat i—1(2 <i<n); 
Q. : H, is in seat n. 


Then g(n) is just the number of elements of S possessing none of these proper- 
ties. In applying (6.4), not every possible combination of properties is available, 
for example, P, and Q2 cannot both be satisfied together. Properties which can 
be satisfied simultaneously are called compatible. Let r, denote the number 
of ways of choosing k compatible properties from the P; and the Q;. Then, for 
each such choice, the number of arrangements of the husbands into seats so 
that these k properties are satisfied will be (n — k)!. Thus by (6.4), 


g{n) = nl —ri(n — 1)! 4+ re(n — 2)! —...4+ (—-1)" rp 0!. 


To find r,, imagine the properties to be arranged in a circle as shown in 
Figure 6.4. 

Properties are then compatible precisely when no two are adjacent; so r, is 
just the number of ways of choosing k elements from a circular arrangement 
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P, 
1 Q2 
. P, P, 

Qn Q3 

D 

: +3 

eA 
Figure 6.4 


of 2n elements, no two of the chosen & being adjacent. Now it was shown in 
Exercise 2.15(b) that the number of ways of choosing k non-adjacent elements 
from 1,... ,2n is arade so r, is this number minus the number of such 
choices which contain both 1 and 2n. But if 1 and 2n are chosen then 2 and 2n—1 
are not, and so k— 2 non-adjacent numbers are chosen from 3,... ,2n—2. Since 
there are 2n — 4 = 2(n — 2) numbers here, there are (?"~47(42)41) = (nk) 


-2 k-2 
such choices. So finally 


a 2n—k+1 Z 2n-k-1 _ 2n 2n—k 
— k k-2 }~ %_-kK\ k 


and the answer M(n) to the ménage problem is 2(n!)g9(n), where g(n) is 


2 =4 Qn (2n-2 2 
coe Joos (5 Jin ayim... (yr (Mon 


For example, M4(5) = 2 x 5! x 13 = 3120. 

Looking ahead, there is a nice alternative way of interpreting g(n); it is the 
number of ways of choosing a third row of an n x n latin square whose first. two 
rows are 


1 2 3 n-lin 
23 4-5. on 2. 


Exercises 


Exercise 6.1 


Extend (6.2) to four sets A, B,C, D. 
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Exercise 6.2 


Each student in a class of 100 reads at least one of mathematics and 
computing; 67 read mathematics and 44 read both. How many read 
computing? 


Exer Cio 6.3 

Each of the 100 students at a music school play at least one instrument, 
string, woodwind or brass. 70 play a string instrument, 49 woodwind and 
49 brass. 20 play both string and woodwind; 25 play string and brass, 
and 35 play both woodwind and brass. How many play all three types 
of instrument? : 


Exercise 6.4 


How many positive integers < 1000 are divisible by none of 7,11 or 13? 


Exercise 6.5 


(a) Imitate the argument of Example 6.2 to show that if G is a graph 
with 100 vertices, each of degree > 75, then G must contain Ks a 
a subgraph. 

(b) Generalise to the following. If G is a graph with mn vertices, each 
of degree > m(n — 1), then G must contain K,4; as a subgraph. 


Show that the number of solutions of the equation z + y + z = 100in 
non-negative integers z,y,z with z < 50,y < 40, z < 30 is 231. 


Exercise 6.7 


How many permutations are there of 1,... ,8 in which none of the pat- 
terns 12, 34,56, 78 appears? 


Exercise 6.8 


How many permutations of 1,... ,8 are there in which no even number 
appears in its natural position? 
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Exercise 6.9 


Repeat Example 6.5 on the Enigma machine, with 5 replaced by 8. 


Exercise 6.10 


How many permutations of 1,1, 2,2,3,3,4,4,5,5 are there in which no 
two adjacent numbers are equal? 


Exercise 6.11 


The Euler phi function. Let. n = py?! ...p% be the prime factorisation 
of n and let $(n) denote the number of positive integers < n which are 
coprime to n (i.e. which are divisible by none of pi,... ,p,). For exam- 
ple, 6(10) = 4, the numbers being 1,3,7,9. Use the inclusion—exclusion 
principle to show that ¢(n) = nJ],(1 — 2). Hence find $(100), (200). 


Exercise 6.12 


Let G be a graph with n vertices and m edges. Let S be the set of all 
possible colourings of the vertices using A colours, ignoring adjacencies: 
so |S| = A". For each i < m, let P; denote the property that the end- 
points of edge e; receive the same colour. Then f,(G), the number of 
vertex colourings of G using 4 colours (see Exercise 5.19), is the number 
of elements of S possessing none of the properties P;. Deduce that 


f(G) = A" + aA" + aga"? +--+ + Gp A + On 


where a, = —m and aj = (Gy — t, where ¢ denotes the number of 
subgraphs of G isomorphic to K3. 


Exercise 6.13 


(An example from Abraham de Moivre’s book Doctrines of Chance, pub- 
lished in 1717.) If 12 throws of a die are made, what is the probability 
that all six numbers appear? 


Exercise 6.14 


Use the inclusion-exclusion principle to find the number of partitions of 
{1,... ,10} into four parts, none of which is a singleton set. 


Latin Squares and Hall’s Theorem 


In this chapter we study latin squares and their orthogonality, and construct 
magic squares. We also discuss Hall’s theorem on systems of distinct represen- 
tatives and apply it to latin squares and bipartite graphs. Finally, we show how 
complete sets of orthogonal latin squares lead to affine planes. 


7.1 Latin Squares and Orthogonality 


Definition 7.1 


A latin square of order n is an n x n array in which each row and each column 
contains each of n given symbols exactly once. 
Here, for example, are two latin squares of order 4, based on the set {1,2, 3, 4} 


123 47 123 4] 
: eae ae ee 4321 
eg 3 9.48 B= pory ag 
214 3 3412 


Much interest in latin squares arose through their use in statistical experimental 
design, but they crop up in many different areas of discrete mathematics and 
algebra; a trivial example is the fact that the composition table of a finite group 
is a latin square. 
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Example 7.1 


Suppose that 2n teams play in a league, with each team playing one game on 
each of 2n — 1 consecutive weekends. Altogether each team has to play every 
other team once. Label the teams by 1,... ,2n, write a;; = k if teams i andj 
play each other on the kth weekend (i # j) and put a;; = 2n for each 7. Then 
A = (aj;) is a latin square of order 2n. For example, the games 


weekend1: 1lv2, 3v4 
weekend 2: 1v3, 2v4 
weekend 3: lv4, 2v3 


give the latin square 


4 12 8 
143 2 
23 4 1 
3.2 1 4 


Note that this latin square is symmetric about the main diagonal: a,; = 0; 
for all 1,7. Conversely, any symmetric latin square of order 2n with a constant 
main diagonal can be interpreted as the fixture list for a league of 2n teams. 

Many applications of latin squares make use of the concept of orthogonality. 
This idea goes back to Euler. In 1779 he discussed a problem concerning 36 
officers. These officers were from six different regiments, six from each; among 
the six from each regiment was one officer of each of six different ranks. Euler 
asked if it were possible to arrange the 36 officers in a 6 x 6 array, so that 
each row and each column contained one officer of each regiment and one of 
each rank. He believed (correctly, although he couldn’t prove it) that such an 
arrangement is impossible. 

Instead, consider the corresponding problem of 16 officers, from four regi- 
ments a, {,y,6, with one officer of each of the ranks a, b,c,d from each regi- 
ment. This problem can be solved. Here is a solution; in it, for example, +d 
stands for an officer from regiment 7y and of rank d: 


aa Bb yc dd 
yd dc ab Ba 
6b ya Bd ac 
Bc ad éa yb. 


Since there is to be one officer from each regiment in each row and column, 
the letters a, 8,y,6 must form a latin square; similarly for a,b,c,d. Further, 
since there is just one officer of each rank from each regiment, the pairs (Greek 
letter, Latin letter) must all be distinct. This is achieved in the given solution, 
and the reader can check that the Greek letters correspond to the latin square 
L, and the Latin letters to the latin square D2. 


7. Latin Squares and Hall’s Theorem 123 


Definition 7.2 


(i) If A = (a;;) and B = (bj) are n x n arrays, the join (A, 8) of A and B is 
the n x n array whose (i, j)th entry is the pair (a;;, bj;). 


(ii) Latin squares A, B are orthogonal if all the entries in their join (A, B) are 
distinct. 


If A and B are orthogonal, we call each of A and B an orthogonal mate 
of the other. Thus, for example, L; and L2 are orthogonal mates. Note that 
the requirement that all the entries in (A, B) are distinct is equivalent to the 
requirement that each of the n? possible pairs occurs exactly once. Note also 
that the condition for orthogonality of A and B can be expressed as: 


if aj; = ayy and bj; = by; then i = J and j = J. (7.1) 


The join of two latin squares is called a Graeco-Latin square because of 
Euler’s use of Greek and Latin letters, as above. The name latin square for 
a single array was a later introduction. Euler’s officer problem has a solution 
only if there exist two orthogonal latin squares of order 6; eventually, in 1900, 
it was proved beyond doubt that no such squares exist. 

More generally, latin squares A;,...,A, of order n are mutually orthogo- 
nal if they are orthogonal in pairs, i.e. if, for alli # 7, A, and A; are orthogonal. 
We shall use the abbreviation MOLS for mutually orthogonal latin squares. 

For a given 7, there is a limit to the number of MOLS of order n that can 
exist. We let N(n) denote the largest value of r for which r MOLS of order n 
exist. 


Theorem 7.1 
N(n) <n —1 for all n > 2. 


Proof 


Suppose L,,...,L, are r MOLS of order n. By relabelling the elements of 
each (which does not affect the orthogonality condition) we can suppose that 
each square has first row 1,2,...,n. Concentrate on the entries in the (2,1) 
position. Since each square already has a 1 in the first. column, none of these 
(2,1) entries can be 1. But, further, no two of them can be equal, for the join 
of any two of the squares already has each repeated pair in the first row. So 
r<n-l. 


A set of n— 1 MOLS of order 1, if it exists, is called a complete set of MOLS. 


Example 7.2 
Here is a complete set of 3 MOLS of order 4: 
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12 3 4 123 4 1234 

21 4 3 43 2 1 3.4 1 2 
M = » A= » Mg= 

3°94 1 2 21 4 8 4321 

43 2 1 3.4 1 2 21 4 3 


The next theorem establishes that complete sets exist wherever n is prime. 


Theorem 7.2 
N(p) = p— 1 for all primes p. 


Proof 


We define square arrays A,,..., Ap—, as follows. For the (¢,7)th entry of A,, 
take al) = ki+ 7, reduced modulo p to lie in the set {1,... ,p}. 


(i) We first check that each A, is a latin square. First, the entries in the 
ith row are all different; for if a = as") then ki+j = ki + J(modp)} 
whence j = J. Secondly, the entries in the jth column are all distinct; 
for if a) = a then ki +7 = kI + 7(modp) whence k(i — J) = 
O(mod p). Thus p divides k(t — I), whence p divides i — J, so that i =/ 
(mod p) and i must equal J. 


(ii) We now use (7.1) to check that A, and Ap are orthogonal whenever k # h. 
Suppose af?) = af) andal” = af). Then kit+j = kI+J and hit+j = hl+ 
J (mod p). Subtracting one from the other gives (h—k)i = (h—k)I(modp), 
and, as above, this gives i = J. Substituting back now gives 7 = J(mod p) 


whence j = J. 


Example 7.3 E 
Here is a complete set of 4 MOLS of order 5: ¥ 
23 451 3 4 5 1 2 
3.4 5 1 2 5 123 4 
A=|[4 5 12 3],42=|2 345 1], 
5 12 3 4 45 1 2 3 
22345 123 45 
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45 1 2 3 5 12 3 4 
23 4 5 1 4 5 1 2 3 
Azs=|]5 12 3 4],As=] 3 4 5 1 2 
3 4 5 1 2 2 4 5 1 
12 3 4 5 123 4 5 


. = - = 


It should be noted here, for those readers who know about finite fields, that a 
similar type of argument establishes that N(¢) = q — 1 whenever g is a prime 
power. (See Exercise 7.9.) 


7.2 Magic Squares 


A magic square of order n is an n x n array containing each of the numbers 
1,... ,n*, and such that each row, each column and the two main diagonals all 
have a common sum (which in fact must be }n(n? + 1); see Exercise 7.3). 


Example 7.4 

Here is a magic square of order 3: : 
8 1 6 
3.5 7 
4 9 2 


Magic squares of all orders n > 3 exist. If n is odd, the method brought 
back from Siam by de ia Loubére in the seventeenth century can be used, as 
in Example 7.4. Start with 1 in the centre of the top row; in general, travel 
northeast, and put the next number in the next square if the square is free; if 
you go off one edge, reappear at the opposite edge; if travelling northeast takes 
you to an occupied square, go south instead. 


Example 7.5 
Check that de la Loubére’s method in the case n = 5 gives the following: 
17 24 #1 8 15 
23 5 7 14 16 
' 4 6 13 20 22 
10 12 19 21 3 
11 18 25 #2 9 
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Methods of constructing magic squares of even order are more complicated. 
Some squares can be constructed by a general method due to Euler, involving 


MOLS. 


Example 7.6 


Take the join of the second and third latin squares of Example 7.2: 


11 
43 
24 
| 32 


22 
34 
13 
41 


33 
21 
42 
14 


44 
12 
31 
23 


= 


Reduce the first number in each pair by 1 to obtain 


01 


22 


23 


04 


(7.2) 


and then interpret the entries as “base 4” representations of the numbers | to 
16 (zy standing for 42 + y). This gives the following magic square: 


1 6 11 16 
15 12 5 2 
8 3 14 9 
1013 4 «74 


Why this is a magic square is easy to see from (7.2); each row, column and 
main diagonal in (7.2) contains each of 0,1,2,3 once in the first. position and 
each of 1,2,3,4 once in the second position, and therefore have equal sums. 

This method will work in general provided we can find two MOLS in which 
both diagonals contain each element exactly once. It should now be clear why 
we did not take the first latin square of Example 7.2. 

The following Indian magic square, dating back to the twelfth century, has 
the extra property that every (broken) diagonal has the same sum: 


7 12 #1 #14 
2 13 8 il 
(7.3) 
16 3 10 5 
9 6 15 4 
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For example, 12+ 8+5+49 = 124+2+5+415 = 34. Such a magic square is 
called pandiagonal or diabolic. If we want to use Euler’s method to construct 
diabolic magic squares, then we have to choose two orthogonal latin squares 
such that every broken diagonal has the same sum (or, in particular, contains 
all the elements exactly once). This can be achieved in many cases. 


Example 7.7 


Take Ag and A3 in Example 7.3. Reducing each element of Ag by 1 and inter- 
preting the pair (x,y) in the join by 52 + y, we obtain 


14 20 21 7 13 
22 3 9 20 21 
10 11 17 23 4 
18 24 5 6 12 

1 7 13 19 25 


More generally, if n is odd and is not divisible by 3, we can take A = (a;;) and 
B= (bj;) where aj; = 21+ 7 — 2(mod n), b;; = 31+ 7 — 2(mod n). Then the join 
of A and B leads to a diabolic latin square. The details are left to the reader 
in Exercise 7.6. 


7.3 Systems of Distinct Representatives 


Latin squares can be constructed a row at atime. Given the first row, the second 
row has to be a derangement of the first; but, in general, if the first. r rows of 
a hoped-for n x n latin square have been constructed, is it always possible to 
find a suitable (r + 1)th row? Since r elements have so far appeared in each 
column, the set Y; of available entries for the ith position in the (r + 1)th row 
has size n — r. The problem is: can a different element be chosen from each 
of X,,... , Xn? If so, then these elements will form an (r + 1)th row. 


Example 7.8 
Suppose we have the first 2 rows as follows: 


123 4 5 
3.14 5 2 
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Here xX ae {2,4,5}, Xo = {3, 4,5}, X3 = {1, 2,5}, X4 = {1, 2,3}, X5 = 
{1,3, 4}. We could choose 2, 3,5,1,4 from X1,... ,X5 respectively to get 


12 3 4 5 
3°14 5 2 
23 5 1 4 


| 


A system of distinct representatives (SDR) for the sets A,,... , Am con- 
sists of distinct elements 71,... ,Zm such that 2; € A; for each i. 


Example 7.9 


(a) 3,1,4,2 form an SDR for the sets {1,3, 5}, {1, 2}, {3, 4}, {2, 3, 4}. 
(b) The sets {1,2,4}, {2,4}, {1, 4}, {1, 2}, {1, 5}, {3,4,5} do not possess an 
_ SDR since the first four of the sets contain only three elements in their 
union, not enough to provide distinct representatives of each. 
This example illustrates the only situation which can stop a collection of sets 
having an SDR. We shall say that sets Aj,... , A, satisfy the Hall condition 
if 


for all k <n, the union of any k of the sets 
: (7.4) 
A; contains at least k elements. 


Theorem 7.3 


The sets A;,... ,An possess an SDR if and only if they satisfy the Hall condi- 
tion (7.4). 


Proof 


[We give a proof based on ideas of R. Rado. It uses the simplest form of the 
inclusion-exclusion principle: |X UY] = |X|+|¥|-|X nY|,] 

If the sets possess an SDR they clearly must satisfy (7.4). So now suppose 
that A;,...,An satisfy (7.4); we show they possess an SDR. We begin by re- 
moving, if possible, an element from one of the sets A; so that the resulting 
sets still satisfy the Hall condition. We then continue to remove elements in 
this way, one at a time, until we obtain sets B,,...,B,, with B; C A; for each 
i, such that the removal of any further element from any B,; would cause the Hail 
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condition for the B; to be violated. If we can show that each B; is a singleton 
set (i.e. |B;| = 1 for each 1), then the sets B; themselves must be disjoint and 
hence provide an SDR for the Aj. 

So suppose B, has two elements z,y. If either is removed, Hall’s condition 
for the B, is violated, so there are two sets P,Q of indices such that if 


-{2)UUB, Y=(B-W)UUB 


ieP i€Q 
then |X| < |P| and |Y| < |Q|. But 


Xuy=Bu |) B, XnY2D UL B, 
i€PNQ i€PNQ 


and the Hall condition gives 
|IXUY|>14|PUQ], |XNY|>IPNQl. 


Thus the inclusion-exclusion principle gives 


IPI+IQ] = |X| +1¥)=|XuY|+|xny¥| 
> 1+|PUQ|+|PNQ| 
= 14+|P|+1Ql, : 


a contradiction. So B; must have just one element. A similar argument can be 
applied to each B;, so |B;| = 1 for all ¢, as required. 


We now establish an important consequence of Theorem 7.3 (or Hall’s 
theorem, as it is called), which will be particularly useful. 


Theorem 7.4 


Let A,...,An be subsets of S such that, for some m, 
(i) |Ai| = m for each i, and 

(ii) each are of S occurs in exactly m of the Aj. 
Then Aj,...,An possess an SDR. 


Proof 


We show that the sets A; satisfy (7.4). Consider the union of k of the Aj. 
Including repetitions, the union contains km elements. But, by (ii), no element 
can occur in this union more than m times; so the number of distinct elements 
in the union is at least km =k. 


The first use of Theorem 7.4 is to confirm that latin squares can be built up 
row by row. 
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Definition 7.4 


Jfr <n, anr xn latin rectangle on an n-element set S is an r x n array 
of elements of S such that no element occurs more than once in any row ot 
column. 


Theorem 7.5 


Any r x n latin rectangle with r < n can be extended to an (r + 1) x n latin 
rectangle. 


Proof 
Let L be an r x n latin rectangle on {1,... ,n}. For each i < n let A; denote 
the set of elements of {1,...,7} which do not occur in the ith column of L. 


Then |A;| = n-—r for each i. Further, for each j <n, j occurs in each row of L 
and hence has appeared in r of the columns; so j must occur in precisely n—-r 
of the A;. Thus we can take m = n —r in Theorem 7.4 and conclude that the 
sets A; possess an SDR which can be taken as the (r + 1)th row of the required 
rectangle. 


Our second application of Theorem 7.4 is really just a reformulation of the 
result in terms of bipartite graphs. 


Definttion 7.5 


A set of disjoint edges in a graph G is called a matching. If G has 2n vertices 
a matching with n edges is called a complete or perfect matching of G. 


Theorem 7.6 ' 
Let G be a bipartite graph with vertex set bipartition V = BUW where 
|B| = |W| = n, and where every vertex of G has the same vertex degree m. 


Then G possesses a perfect matching. 


Proof 
Suppose that B = {uy,...,tn} and W = {u,...,un}. For each i <n let 
A; = {j : u; and v; are adjacent }. 


Then the sets A; satisfy the conditions of Theorem 7.4, and so the A; possess 
an SDR. The SDR gives a perfect matching in which u, is adjacent to v; where 
j represents A,. 


For regular bipartite graphs such as those in Theorem 7.6 we immediately 
obtain an alternative proof of Kénig’s result that x/(G) = A for bipartite 
graphs. For we can first find a perfect matching and colour its edges with 
one colour. The graph obtained by removing this matching then satisfies the 
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conditions of the theorem with m = A — 1, so we can repeat the argument and 
obtain a second perfect matching which we then colour with another colour. 
Continuing in this way we partition the edge set into A perfect matchings and 
thus obtain an edge colouring using A colours. 

Theorem 7.6 was in fact proved first by Steinitz in 1893 and then indepen- 
dently by Konig in 1914. It is, of course, a special case of Theorem 5.13, since 
the edges of any one colour form a matching. 


w 


7.4 From Latin Squares to Affine Planes 
In this section we show how complete sets of MOLS give rise to an important 
family of designs. 

We start with three MOLS of order four, as given in Example 7.2, along with 
the following “natural” array N: 


1 2 3 4 

5 6 7 8 
N= 

9 10 li 12 

13 14 15 16 


From them, we construct 20 4-element sets as follows. 
(a) The rows of N give four sets: 


{1,2, 3,4}, (5,6, 7,8}, {9, 10, 11, 12}, {13, 14, 15, 16}, 
and the columns of N given another four sets: 
{1,5,9, 13}, {2,6, 10, 14}, {3,7, 11,15}, (4,8, 12, 16}. 
(b) The first of the three MOLS, namely M;, gives four sets: 
-- {1,6,11,16}, {2,5, 12,15}, {3,8, 9,14}, {4, 7, 10, 13}. 


Here the first set consists of the entries in N in the positions in which 1 
appears in M;, the second consists of the entries in n in the positions in 
which 2 appears in M;; and so on. 

(c) In the same way, M2 gives rise to the following four sets: 


{1,8, 10,15}, {2,7,9, 16}, {3, 6, 12, 13}, {4,5, 11, 14}. 
(d) Finally, M3 similarly gives rise to: 
{1, 7,12, 14}, {2, 8, 11, 13}, {3, 5, 10, 16}, {4,6, 9, 15}. 
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We now have 20 sets, each of size four; we call these sets blocks. The blocks 
possess an important property: no pair of elements from {1,... , 16} occurs in 
more than one of the blocks. Certainly, the blocks in (b), (c), (d) each contain 
one element from each row and each column of N, and so they will contain no 
pair in the same row or column of N; such pairs will occur precisely once in the 
blocks of (a). Consider now any pair not in the same row of column of N. Such 
a pair cannot occur in more than one block of (b), since the blocks in (b) are 
disjoint; and similarly it cannot occur in two blocks of (c) or of (d). Suppose a 
pair occurs in a block of (b) and a block of (c). Then there are two positions 
in which M; has the same entries and M2 also has the same entries. But. this 
contradicts orthogonality of M; and M2. So we have indeed established that 
no pair occurs in more than one block. 

Now each block contains (5) = 6 pairs, so the 20 blocks contain allogether 
6 x 20 = 120 pairs. But the total number of pairs of elements of {1,... , 16} is 
Cy = 120; so every pair must occur in a block! 

Thus the 20 blocks have the property that each pair of elements occurs in 
exactly one of the blocks. This is the balance property that is the basis of the 
study of balanced incomplete block designs which we shall look at in Chapter 
9. 

More generally, if we start with n - 1 MOLS M,,...,M,_1 of order n on 
{1,...,n}, and take N to be the n x n array with entries 1,2,... ,n? 
we construct blocks as follows: 


in order, 


(a1) n blocks from the rows of N 
(a2) n blocks from the columns of N 


(B1) n blocks from M;, the 7th block consisting of the entries in N 
in the positions in which 7 appears in Mj), 


(Bn-1) m blocks from M,-_,, similarly obtained. 


This gives (n+ 1) x n = n? +n blocks of size n, with the property that no 
two elements lie in more than one block. But these blocks contain altogether 
n(n + 1)(3) = §n?(n + 1)(n - 1) = (°5) pairs; so every pair of elements of 
{1,... ,n?} occurs in precisely one block. 

We thus obtain a collection of n(n + 1) subsets (blocks) of a set of size n? 
such that 


(i) each block contains n elements; 
(ii) each element is in n + 1 blocks; 
(iii) each pair of elements lies in exactly one block; 


{iv) each pair of blocks intersect in at most one element. 
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(To check (ii), note simply that each element is in precisely one of the blocks 
of each of a, 2, 81,..- ;Bn—1- To check (iv), suppose that blocks 6,, 82 have 
two elements 2, y in common. Then the pair {z,y} would occur in two blocks, 
contradicting (iii)). 

Such a system is called an affine plane of order n. Compare the elements 
with points, and the blocks with lines in ordinary geometry. The property 
(iii) corresponds to the fact that any two points determine a unique line, and 
(iv) corresponds to the fact that any two lines intersect in at most one point. 
Lines which do not intersect are usually called parallel; we can think of each 
of (a;),.-. ,(B8n—1) as consisting of a set of n parallel lines, each forming a 


partition of {1,... ,n?}. 


Example 7.10 
Take the two MOLS of order 3 given by 
1 2 3 1 2 3 
M,-|2 3 1], Me-]3 1 2 
3 1 2 23 1 
and take 
12 3 
N=]4 5 6 
7 8 9 


Then we obtain 12 blocks of size 3: 
{1, 2, 3}, {4,5, 6}, {7,8,9}, << rows of N 


{1, 4, 7}, {2, 5,8}, {3,6,9}, < columns of NV 
{1, 6, 8}, {2,4,9}, {3,5,7}, «© from My 
{1,5,9}, {2,6, 7}, {3, 4,8}. «© from Mp. 


Each of these four sets of three blocks partitions {1,...,9}, and can be 
thought of as a set of “parallel lines”. Any two elements occur together in a 
unique block. Thinking of the blocks as lines, we represent the set-up by Figure 
7.1, where eight of the lines are drawn straight, and four are drawn as arcs. 
From orthogonal squares we have got to the verge of design theory. We shall 
take up the thread in the final chapter. 
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Ft 
PS 


7 9 
Figure 7.1 


Exercises 


Exercise 7.1 


Write down two orthogonal squares of order 3 as given by Theorem 7.2. 


Exercise 7.2 


Write down the schedules of games given (as in Example 7.1) by the 


latin square 
6 1 


3 
1 4 


1 


ao Fe fw dN 
oOo wo Fe oO OD 
me Oe 
So oOo Dm 
ao mn © DO 
oOo fF Ww am 


Exercise 7.3 


Verify that each row and each column of a magic square of order n must 
sum to $n(n? + 1). 
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Exercise 7.4 


Use de la Loubére’s method to construct a magic square of order 7. 


Exercise 7.5 


Use Fuler’s method to construct a diabolic magic square of order 7. 


Exercise 7.6 


Very that if n is odd and not divisible by 3, then A = (aij) and B = 


(bi3), where ai = = 2i + j = 2 Hj = = 3t + j — 2 (mod 7), ale orthogonal 
latin squares of order n and that, in both, every diagonal contains each 
of 1,...,n : 


Exercise 7.7 


A latin square A is self-orthogonal if it is orthogonal to its transpose 

A’. (This is the usual matrix transpose.) 

(a) Verify that, in Example 7.2, M2 is self-orthogonal. 

(b) Are M1, M3 self-orthogonal? 

(c) Show that, if (n,6) = 1, the latin square A of Exercise 7.6 is self- 
orthogonal. 


Exercise 7.8 


diagonal. Show iiist nm must be odd. 
(b) Let n = 2m+ 1 and define a;; to be the member of {1,... ,n} 
for which aj; = (i + j)(m + 1)(modn). Show that A = (aj) is a 
symmetric latin square with 1,... ,n on the main diagonal. Find A 
: in the case n = 5. 


« 


Exercise 7.9 


Show that N(q) = q — 1 for all aia power q, using the finite field 
GF(q). Let GF(q) = {A1,A2,.-- ,Ag—1,Ag = O}, and define the squares 
Arp, 1 <k<q-1, by alt Jay, irk + Aj. (dnnitate the proof of Theorem 
7.2.) 
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Exercise 7.10 


Apply Euler’s diabolic square construction method to the square M2 of 
Example 7.2 and its transpose MJ. Compare your answer with (7.3). 


Exercise 7.11 
f12 3 4 8&] 


Extend to a latin square of order 5. 
43 15 2 

Exercise 7.12 

Find an SDR for the sets 41.3.5). 1.4.51. £9.2.41. {1.2 al 

Find an SDR for the sets {1,3,5}, {1,4, 5}, {2,3, 4}, {1, 2,4}. 


Exercise 7.13 


Explain why the sets {1, 2,3, 4}, {2, 5,6}, {1, 4,5}, {2,6}, (5, 8}, {1, 4, 7}, 
{2,5}, {5,6} do not possess an SDR. 


Exercise 7.1 4 


The 52 cards of an ordinary pack, consisting of four suits of 13 different 
values, are arranged in a 4 x 13 array. Prove that 13 cards of different 
values, one from each column, can be chosen. 


_ 


Exercise 7.15 


A set S of mn elements is partitioned into m sets of size n in two different 
ways: S =A, U...UAm = Bi U...UB,. Show that the sets B; can be 
relabelled so that A; B; 4 @ for each i = 1,... , m. [Hint: consider the 
sets S; = {7 : Ai NB; 4 }.] 


Exercise 7.16 


Starting with the four MOLS of order 5 given in Example 7.3, construct 
an affine plane of order 5. 


Exercise 7.17 


An n Xn matrix is called a permutation matrix if all entries are 0 or 
1, and there is precisely one 1 in each row and each column. Show that, 
if M is an n Xn matrix with all entries 0 or 1, with exactly m 1s in each 
row and column, then M can be written as the sum of m permutation 
matrices. Illustrate in the case where M is the matrix given by (9.2) in 
Chapter 9. 


8 


Schedules and 1-Factorisations 


This chapter deals with the construction of league schedules for sports com- 
petitions or, equivalently, experimental designs involving comparisons of pairs 
of varieties. ‘I'he subject matter provides nice connections with latin squares 
and edge colourings of graphs, and also provides an introduction to the ideas 
of block designs and resolvability which will be studied further in the final 
chapter. So the chapter is an exposition of interrelations between apparently 
different combinatorial ideas. 


8.1 The Circle Method .- 


Suppose that a football league contains eight teams, each of which plays every 
other team once. The games are to be arranged for seven Saturdays, with four 
games each Saturday, each team playing in one game each Saturday. How can 
a fixture list be constructed? 


Suppose also that a biology researcher wishes to compare eight types of treat- 
ment, comparing each pair of treatments once. During the first week she will 
carry out four comparisons, using all eight treatments; then during the second 
week another four comparisons, and so on. Construct a suitable schedule of 
comparisons for seven weeks. 


These two problems are clearly equivalent. Indeed, they are both equivalent to 
partitioning the set of edges of K’g into seven sets of four disjoint edges (i.e. into 
seven complete matchings), four edges for each of seven weeks. If we replace 
weeks by colours, we see that both problems are equivalent to finding an edge 
colouring of A’g using seven colours. 
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Example 8.1 


The edge colouring of K, shown in Figure 5.4(a) shows a 3-colouring using 
colours 1, 2,3. It corresponds to the league schedule: 


roundl1: AvB, CvD 
round2: AvC, BvD 
round3: AvD, BvC. 


For 2n teams, we have to find an edge colouring of Ka, using 2n — 1 colours. 
We have already seen how to do this in Theorem 5.11. We describe the method 


again. in slightly different language. 
Se et ae te OMe eT ees er ae eee 


The circle method 


Label the teams by 00, 1,2,... ,2n—1, and place 1, 2,... ,2n—1 equally spaced 
round a circle with oo at the centre. The case 2n = 8 is shown in Figure 8.1. 
In Figure 8.1({a) we have the fixtures for the first day, and Figure 8.1(b) gives 
the fixtures for the second day. By rotating the chords we obtain the games 


(b) 


Figure 8.1 


for each of the 2n ~ 1 days. In general, for 2n teams oo, 1,... , 2m — 1, on day 
i we play the games 
tvoo,(i-—1)v (41), @-2)v G+ 2),...,@-M—-)) v(iit(n—-))) 

where each integer is reduced (mod 2n — 1) to lie in the set {1,... ,2n — 1}. 
Example 8.2 
Fixture list for eight teams constructed by the circle method. 

Day 1 oovl 7v2 6v3 5v4 

Day 2 oov2 lv3 7v4 6v5 

Day 3 cov3 2v4 1vi 7v6 

Day 4 oov4 3v5 2v6 lv7 

Day5 oov5 4v6 3v7 2v1 


Day6 cov6 5v7 4v1 3v2 
Day7 oov7 6vl 5v2 4v3 
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Note that, on day 7, teams u and v play each other where u+v = 2i(mod 2n—1). 
Note too that the schedule is cyclic in the sense that each round is obtained 
by adding 1(mod 2n — 1) to each entry in the previous round (oo earning itself 
its name: 00 + 1 = 00). 

The circle method yields a partition of the edge-set of Kon into n edge disjoint 
perfect matchings (1-factors). The name 1-factor reminds us that each vertex 
in a perfect matching has degree 1 in that matching. For example, the edge 
colouring of K4 in Figure 5.4(a) gives a partition of the edge-set of K4 into 
three 1-factors. 


Definition 8.1 


A 1-factorisation of a graph G with 2n vertices is a partition of the edge-set 
into 1-factors. 


Thus the circle method establishes the following result. 


Theorem 8.1 
Kon has a 1-factorisation. 


Not all graphs with an even number of vertices have 1-factorisations. Indeed, 
it should be clear that a graph can have a 1-factorisation only if it is regular, 
ie. if every vertex has the same degree. If each vertex has degree r, then the 1- 
factorisation, if it exists, will consist of r 1-factors. Since a 1-factorisation clearly 
gives an edge colouring of the graph with the minimum possible number of 
colours, and, conversely, an edge-colouring of a graph, regular of vertex degree 
A, using A colours, clearly gives a 1-factorisation, we have: 


A regular graph G with 2n vertices has a 1-factorisation if and only if it is class 
1. 


Thus, by Example 5.13, the Petersen graph does not have a 1-factorisation. 


Example 8.3 


How many 1-factorisations does K, have? Let Kg have vertices a,...,f, and 
suppose that one of the 1-factors in a 1-factorisation is ab, cd, ef. In some other 
l-factor we must have edge ac. There are only two possibilities: 


ac,be,df or ac,bf,de. 
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But the 1-factorisation including 


ab cd ef 
ac be df 
ad 
ae 
af 
can be completed in only one way (check this!) as can 
ab cd ef 
ac bf de 
ad 
ae. 
af 


So there are only two 1-factorisations containing the 1-factor ab, cd, ef. Simi- 
larly there are only two 1-factorisations containing any other 1-factor. But the 
total number of 1-factors is just the number of ways of partitioning a set of six 
elements into three pairs, and so, by Corollary 5.2, is wey = 15. For each of the 
15 choices of 1-factor, there are two ways of extending it to a 1-factorisation, 
giving 30 1-factorisations altogether. But each distinct 1-factorisation arises 
five times in this way, depending on which of the five 1-factors in it is taken 
first. So the number of distinct 1-factorisations is 0 = 6. 

Thus Kg has six different 1-factorisations. The number of 1-factorisations of 
Koa, rises dramatically with n: for Kg there are 6240, and for Kio there are 
1255 566 720. 


Schedules for 2n +1 teams 


if a league schedule is to be arranged for 2n+1 teams, then there can be at most 
nm games on any one day, with one team resting. Such a schedule corresponds 
to an edge colouring of Kon41 using 2n + 1 colours, as in Theorem 5.11. The 
easiest way to obtain such a schedule is use the circle method to construct a 
schedule for 2n + 2 teams and then omit all the games involving oo. 


Example 8.4 


Suppose we want a schedule for five teams. Take the schedule for six teams 
given by the circle method: 


oovl 5v2 3v4 
oov2 lv3 4v5 
oov3 2v4 5vl 
oov4 3v5 lv2 
oovd 4v1 2v3. 


Omit all the games involving oo, to obtain 


5v2 3v4 
lv3 4v5 
2v4 5vl 


3v5 lv2 
4v1l 2v3. 


This is in fact the schedule used in recent years for the Five Nations Rugby 
championship involving Fngland, France, Ireland, Scotland and Wales. For ex- 
ample, in the final year of that championship (1999), the key was: 


1 = England, 2 = France, 3 = Scotland, 4 = Wales, 5 = Ireland. 


In the year 2000 the championship became the Six Nations championship with 
the inclusion of Italy. The schedule used was that obtained above for six teams, 
with the first round cov 1,2v5,4v 3 and with the games involving oo being 
given suitable home and away orientations. This gave 


oovl 2v5 4v3 
2voo 3vl1 5v4 
“Svoo 4v2 1v5 
oov4 5v3 2vl 
5voo lv4 3v2 
with the key oo = Italy, 1 = Scotland, 2 = Wales, 3 = Ireland, 4 = England, 5 = 
France. In the Five Nations championship the schedule automatically arranged 
for each team to alternate home and away games. This is not possible for an 
even number of teams (see Exercise 8.10); in the above, each team has one 
“break” in the alternating of venues. 
Summing up, we have the following theorem. 


Theorem 8.3 

(i) For all n > 1, there exists an arrangement of the (7") = n(2n—1) 2-element 
subsets of {1,... ,2n} into 2n — 1 classes, each class consisting of n disjoint 
pairs. 


(ii) For all n > 1, there exists an arrangement of the ea 2) = n(2n + 1) 2- 


element subsets of {1,... ,2n +1} into 2n + 1 classes, each consisting of 
n disjoint pairs, with each element being absent from the pairs in exactly 
one class. 


There is a natural generalisation of (i). Can the (4) = }n(3n — 1)(3n — 2) 
3-element subsets (triples) of {1,...,3n} be arranged into $(3n — 1)(3n — 2) 
classes each consisting of n disjoint triples? The case n = 2 is trivial, since 
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every triple can be paired with its complement. The case n = 3 was solved by 
Sylvester. Here is an elegant solution. 


Example 8.5 


To put the 84 3-element subsets of {1,... ,9} into 28 groups of three triples 
such that the triples in each group form a partition of {1,... ,9}. 


Solution 


Consider the following 
msiger tne 1Qowmg ven square arrays. 


123 372 4234 318 531 912 261 
456 156 756 426 486 453 459 
789 489 189 759 729 786 783 


For each array we get four groups of 3 triples, from the rows, the columns, 
the forward leading “diagonals” and the backward leading diagonals. The first 


array gives 
123, 456, 789 (rows) 


147, 258, 369 (columns) 
159, 267, 348 (forward diagonals) 
168, 249, 357 (backward diagonals). 


Similarly we obtain four partitions of {1,...,9} from each of the other six 
arrays. Altogether we obtain 28 groups of three triples, containing between 
them all the 84 3-element subsets. 


There is a remarkable theorem due to Baranyai (1973) which we state without 
proof. A proof can be found in [18]. 


Theorem 8.4 
The set of all (7) k-element subsets of {1,...,nk} can be partitioned into 
Lnk) = pare classes, each class consisting of n disjoint k-element subsets. 


8.2 Bipartite Tournaments and 1-Factorisations 
of Kain 


Example 8.6 


Two schools, Alpha Academy and Beta High School, arrange a tennis match 
in which each school is represented by 4 players. Each player is to play each 
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player in the opposing team once, and the games are to be arranged in four 
rounds, everyone playing in each round. Arrange a schedule. 


Solution 


Let the players of Alpha Academy be A;,... , Aa, and those of Beta High School 
be By,... , By. 


Round 1 A, v By, Ao v Bo, A3 v B3, Ag v By 


Round 3 Ay v B3, Ao Vv By, Ag Vv By, Aq Vv Bo 
Round 4 A; v By, Ae v B,, A3 v Bo, Ag v Bs. 


Note that this schedule can be represented by a latin square M: in the ith 
column of M list the subscripts of the opponents of A, in order: 


,|i 2 3 4 
23 41 
M= 
3 4 1 2 
“14 1 2 3 
Conversely, any latin square on 1,... ,4 can be interpreted as a schedule for 


a bipartite tournament by reversing this process. Thus bipartite tournaments 
(i.e. tournaments between two teams in which each player of one team plays 
every player of the other team) are equivalent to latin squares. 

Note also that the solution to Example 8.6 can be re-expressed in terms of a 
1-factorisation of the graph K44. In Figure 8.2, the edges of K’,,4 are coloured 
using four colours, corresponding to the rounds in which the games (represented 
by edges) are played. Thus a 1-factorisation of Ky, is equivalent to a bipartite 
tournament for two teams of size n. 


Figure 8.2 
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Since there is a latin square of order n for all n > 1, we have the following 
result. 


Theorem 8.5 


For all n > 1, Kn, has a 1-factorisation. 


There is another way of representing a bipartite tournament. by a latin square. 


Define the latin square N = (ni;) by 
ni; =k if A; plays B; in round k. 


For the schedule in Example 8.5 we obtain 


f123 4) 
412 3 
sad ee wee 
2341 


The square N is conjugate to the square M : nij = k & my; = j. Other 
conjugates of M are obtained similarly by permuting i,j,k. Note that. the 
transpose M7 of M is a conjugate of M : mi, =kemy =k. 


Another 1-factorisation of K», 


The existence of a 1-factorisation of K,,, enables us to give another general 
method of constructing a 1-factorisation of Kan, different from the one arising 
from the circle method. We describe it in terms of a league schedule. 

Given 2n teams, label them 21,... ,2n,¥1,---,Yn- We can then construct a 
bipartite tournament involving n rounds, in the course of which every x; plays 
every y; once. Without loss of generality, we can assume that, for each i, 2; 
plays y; in the first round. 

We now consider two cases. 


(i) If n is even, we can then play a league of n — 1 rounds involving the x; in 
parallel with a league involving the y;. This will give n — 1 further rounds, 
thus completing the required tournament. Here is an example with n = 4. 


Example 8.7 


T,VY1 Zeavy2 @VvYy3 T4v ya 
T1VY2 @2VY3 TVY4, Tavyi 
T,VY3 T2VY4 VY Tav yo 
T,VY4 F2VYr CVY2 eV YB 
TVT @3V%X%q Yrvya2 yzvya 
VT T2VIT%_ Yrvys Yrovya 
T1V% %TeaVTZ Yivya Yovys 
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(ii) If n is odd, this idea does not work; the bipartite tournament cannot 
be extended to a full schedule since the games involving the odd number 
of x; require n further rounds. But we can adapt the idea. Introduce two 
new teams oo,, Coy, and use the circle construction to obtain two schedules 


Sz, Sy On {00z,21,... , Ln} and {ooy,y1,... 


,Yn} respectively, in which oo, 


plays x; and oo, plays y; in round i. For the ith round of the required 
schedule, take the ith round games of S, and S,, but replace the games 
00, Vz; and co, vy; by the one game 2; v y,;. The resulting n rounds, along 
with all but the first round of the bipartite tournament, give the required 


schedule. 


Example 8.8 


We construct a schedule for n = 3. Here S, is 


pi 


COr VL), LavVarg 
Or V%q, ZVI 
Or V%3, Tr VX 
and a bipartite tournament is 
T1VY1, T2VY2, TVYZ 
T1VY2, T2VY3, VY 
Z1VY3, T2VY1, XV Y2- 
The final schedule is 
T1VY1, *C2VI3, Yovya 
T2VY2, %1VT3, WVY¥B3 
T3VY3, X1VT2, Yrvya 4 
Z)VY2, T2VY¥s, Te VY 
T1VY3, T2VY1, 2% VYe2- 


8.3 Tournaments from Orthogonal Latin 


Squares 


Court balance 


Suppose now that, in the tennis match of Example 8.6, four courts are available, 
of differing quality, and it is requested that the games should be arranged so 
that, not only does every A; play every B; once, but each player plays once on 


each court. 


One way to solve this problem is to take the join of two MOLS of order 4, 


say 
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11 22 33 44 
24 13 42 31 
32 41 14 23 
43 34 21 12 


Take the rows as corresponding to the rounds, and the columns as correspond- 
ing to the courts. The latin property ensures that each player plays once in 
each round and once on each court; orthogonality ensures that every A; plays 
every B; once. So we obtain the solution given in Table 8.1. 


Table 8.1 
Court 1 Court 2 Court 3 Court 4 


Round 1 


Round 2} 42.vB, A:vB3 AgvBo A3v By 
Round 3] A3v Bo AgvB, A, vBs AovB3 
Round 4 Ay v Bs Ag Vv Bs Ag Vv B, Ay v Bo 


Mixed doubles 


Another use of orthogonal latin squares is in the construction of mixed doubles 
tournaments. Suppose that Alpha Academy and Beta High School now decide 
to play mixed doubles: each school provides four boys and four girls, and each 
player is to play in four games, partnering each person of the opposite sex 
once, and opposing each player of the other team once. The 16 games have to 
be arranged in four rounds, each of four games, each player being involved in 
one game in each round. 

Let us denote Alpha’s boys by By,...,B4 and Beta’s boys by 6y,... ,b4; 
similarly we denote Alpha’s girls by G,,... ,G4 and Beta’s by g, ... , g4. Take 
also the MOLS My, M2, Mg of Example 7.2. 

We interpret Mj, as giving the partners of the Alpha boys: if the (i, j)th 
entry is k, then B; partners G, when playing against b;. Similarly M2 gives 
the partners of the Beta boys: if the (i,7)th entry of M2 is é then b; partners 
ge when playing against B;. Since M,, Mo are latin squares, no repetition of 
partners occurs. Since the squares are orthogonal, no girl opposes another girl 
more than once. 

Having obtained the games of the required schedule, it now remains to arrange 
them into four rounds. This is achieved by using M3: if the (i, 7)th entry of M; 
is k, place the game involving B; and b; in round k. Suppose this resulted in girl 
G; playing two games in round k. Then we would have two games B,G; v b,g: 
and ByG; v bzgy in round k. But then the (r,s)th and (u,x)th entries of M; 
are both & and the (r, s)th and (u, z)th entries of Md, are both i, contradicting 
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orthogonality of Mf, and M3. Similarly the orthogonality of 4/2 and M3 prevents 
any girl g; playing twice in one round. 

Thus, for example, since the (3,2) entries of M, and Mo are 4 and 1, one of 
the games will be B3G4 v bog,. Since M3 has 3 in the (3,2) position, we assign 
this game to round 2. In this way we obtain the following schedule: 


Round 1 ByG,vb\g,; BoGsvbsgo Bs3Govbag3 ByG3 v bogs 
Round 2 BiGevbeg2 B2G3vbag, B3Givbzsg, ByGs4 vb, 93 
Round 3 BiG3vb3g3 BoGovobigs BsGyvbegi BaGi v bage 
Round 4 BiGyvbyga BoG,vbeg3 BaG3vbigo BsGovbsgi 


Exercises 


Exercise 8.1 


Use the circle method to construct a league schedule for 10 teams. Deduce 
a league schedule for 9 teams. 


Exercise 8.2 


Use the method of Example 8.8 to construct a league schedule for 10 
teams. 


Exercise 8.3 


Suppose that the first r rounds of a bipartite tournament between two 
teams of size n have been constructed (r < n). Can these r rounds always 
be extended to a full bipartite tournament? 


Exercise 8.4 


Find a 1-factor, if one exists, for (a) the Petersen graph, (b) the ethane 
graph (Figure 3.4), (c) the graph of an octahedron, (d) the graph of a 
cube. 


Exercise 8.5 


’ Partition the edges of the graph of an octahedron into two disjoint hamil- 
tonian cycles. Deduce that the graph has a 1-factorisation. 
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Exercise 8.6 


Which of the platonic graphs have a 1-factorisation? 


Exercise 8.7 


Prove that any hamiltonian graph in which each vertex has degree 3 has 
a 1-factorisation. 


Exercise 8.8 


(a) Imitate Example 8.6 for teams of 5 players. 
(b) Obtain a schedule in which each player plays once on each of 5 courts. 


Exercise 8.9 


(a) Construct a mixed doubles tournament between Gamma Grammar 
School and Delta District Comprehensive, with 5 boys and 5 girls in 
each team. 

(b) Suppose there are 5 courts available. Construct a schedule in which 
each player plays exactly once on each court. 


Exercise 8.10 


In a league schedule for 2n teams it is desired that each team should 

alternate home and away fixtures as much as possible. A repetition of 

home (or away) fixtures in two consecutive games is called a break. 

For example, in Example 8.2, team 6 plays HH AAAAH and so has 4 

breaks. 

(a) Show that any schedule for 2n teams can have at most two teams 
with no breaks, and deduce that there must be at least 2n — 2 breaks 
altogether. 

(b) Show that a schedule with exactly 2n — 2 breaks can be constructed. 
(In Figure 8.1, take the home teams as those alternately at left, right 


ends of chords, and alternate oo home and away throughout.) 


Exercise 8.11 


Suppose a league schedule has been constructed for 2n teams, but venues 
have not yet been assigned. Show that it is possible to assign venues so 
that, in each round, precisely one of the two teams of each first round 
game is at home. 
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Introduction to Designs 


*. 
‘ 


— 


We introduce the idea of a balanced incomplete block design, and look at some 
special families of such designs, namely finite projective planes, affine planes, 
Steiner triple systems and Hadamard designs. The connection between finite 
projective planes and complete sets of MOLS is established. We also describe 
the usefulness of difference systems in the construction of designs. Finally we 
give a brief introduction to some of the ideas behind error-correcting codes. 


9.1 Balanced Incomplete Block Designs 


Example 9.1 


Seven golfers are to spend a week’s holiday at a seaside town which boasts two 
splendid golf courses. They decide that each should play a round of golf on 
each of the seven days. They also decide that on each day they should split 
into two groups, one of size 3 to play on one course, and the other of size 4 
to play on the other course. Can the groups be arranged so that each pair of 
golfers plays together in a group of 3 the same number of times, and each pair 
plays together in a group of 4 the same number of times? 


Solution 


Here is one solution: the groups for each day are shown. It can be easily checked 
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that each pair plays together once in a group of 3 and twice in a group of 4. 


Day 1 {1,24} {3,5,6, 7} 
Day 2 {2,3,5} {4,6,7,1} 
Day 3 {3,4,6} {5,7,1,2} 
Day 4 {4,5,7} {6,1,2,3} 
Day 5 {5,6,1} {7,2,3,4} 
Day 6 {6,7,2} {1,3,4,5} 


Day 7 {7,1,3} {2,4,5,6} 


What we have done is to make use of the configuration known as the seven- 
point plane, shown in Figure 9.1. In it, there are seven points and seven 
lines, with each line containing three points, and each pair of points being in 


exactly one linc. The grouns of size 4 are the complements of the ines of this 
eXactuy OnE iin. 2nC Broups O71 Size & are tne COMpreMCnes OF tHe 1ines Or tis 


configuration. 
1 


y << 3 
4 5 7 
Figure 9.1 The seven-point plane 


Example 9.2 4 


The following subsets of {1,... ,6} have the property that each subset has 3 
elements and each pair of elements occurs in two of the subsets: 


{1, 2,3}, {1,2, 4}, {1,3, 5}, {1,4, 6}, (1,5, 6}, 
{2,3, 6}, {2, 4, 5}, {2, 5, 6}, {3, 4, 5}, {3, 4, 6}. 


This example was given by the statistician F. Yates in 1936 in a paper which 
discussed the use of balanced designs in the construction of agricultural exper- 
iments. Many examples of such designs had been discussed by mathematicians 
over the previous hundred years, but his paper crystallised the idea and lead 
to much work on the subject by both statisticians and mathematicians. 


Definition 9.1 


A (v,k,4) design is a collection of k-element subsets (called blocks) of a v- 
element set S, where k < v, such that each pair of elements of S occur together 
in exactly \ blocks. Such a design is also known as a balanced incomplete 
block design (BIBD). 
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The adjective “balanced” refers to the existence of 4, and “incomplete” refers 
to the requirement that k < v (so that no block contains all the elements). We 
have already seen some examples of balanced designs. 


Example 9.3 


(i) League schedules. The games of a league schedule for 2n teams form 


(2n,2 1) design; if each team plays each other twice in a season, the games 


ait, oy GESIGN, +f CAC LOAM Plays CAC OLNEY LWICe Ih a Season, tne £2. 


form a (2n, 2,2) design. ts 


(ii) The seven point plane (Figure 9.1) is a (7,3, 1) design. 
(iii) The design of Example 9.2 is a (6,3, 2) design. 


Gv) An affine 


(iv) An affine plan rT ides 


design. 


In particular, an affine plane of order 3 is a (9,3, 1) design. It consists of 12 
blocks of size 3, such that every pair of elements occur together once in a block. 
Block designs with k = 3 and \ = 1 were among the first designs to be studied. 


Definition 9.2 


A (v,3,1) design is called a Steiner triple system of order v and is often 
denoted by STS(v). 


Steiner triple systems can exist only for certain values of v. To prove this, we 
first obtain the following general result. 


Theorem 9.1 


Suppose that a (v,k, A) design has b blocks. Then each element occurs in pre- 
cisely r blocks, where 


A@w-l=r(kK-1) and bk=vr. (9.1) 
Proof 


Choose any element x, and suppose it occurs in r blocks. In each of these 
r blocks it makes a pair with k — 1 other elements; so altogether there are 
r(k — 1) pairs in the blocks involving x. But x is paired with each of the vu — 1 
other elements 4 times, so the number of pairs involving x is also A(v — 1). So 
d(v — 1) = r(k — 1). This shown that r is independent of the choice of 2, since 
it is uniquely determined by v,k and X. 

To prove that bk = vr, note first that each block has k elements, and so the b 
blocks contain bk elements altogether (including repetitions). But each element 
z occurs r times in the blocks, so we must have bk = ur. 
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Example 9.4 


In an affine plane of order n, i.e. a (n?, n, 1) design, we have 1.(n?—1) = (n-1)r, 
sor=n-+1. Also bn =n?r, sob = n(n +1) =n? +n, as in Section 7.4. 


Example 9.5 


No (11,6, 2) design can exist, since it would require 2(11 ~ 1) = 5r, ie. 


si,? 


and 6b = 44, which is clearly impossible. 


Theorem 9.2 
A STS(v) can exist only if v = 1 or 3(mod 6). 


Proof 


Suppose a (uv, 3, 1) design exists. Then v—1 = 2r and 3b = vr, so that v = 2r+1 
(which is odd) and b = ju(v — 1). If v = 6u+5 then b = 3(6u + 5)(6u + 4) is 
not an integer; so we must have v = 1 or 3(mod 6). 


Note that v = 7 and 9, already dealt with, are of this form. Steiner systems are 
so named because Steiner discussed them in an 1853 paper, having come across 
them in a geometrical setting. But it had already been shown by Kirkman in 
1847 that not only was the condition v = 1 or 3(mod 6) necessary, but it was 
also sufficient. So STS(v) exists if and only if vy = 1 or 3(mod 6). 


Example 9.6 


The sets {1, 2,5}, {2,3,6},...,{9, 10, 13}, {10,11,1},...,{13,1,4,} and the 
sets {1,3,9}, {2,4,10},...,{5,7, 13}, {6,8, 1},... , {13, 2,8} form a STS(13). 
Note that the blocks are obtained from {1, 2,5} and {1,3,9} by adding 1 suc- 
cessively to each element, and working modulo 13. This is similar to the STS(7) 
of Figure 9.1 which is obtained from the initial block {1, 2, 4} and working mod- 
ulo 7. Why this works will be explained in Section 9.5. 


Further progress on designs is assisted by the representation of designs by 
matrices. 


Definition 9.3 


The incidence matrix of a (uv, k, A) design is the bx v matrix A = (a;;) defined 
by 
{ 1 if the ith block contains the jth element, 
a= 


0 otherwise. 


9. Introduction to Designs 153 


For example, the incidence matrix of the seven-point plane of Figure 9.1, with 
blocks {1, 2, 4}, {2,3,5},...,{7, 1,3} is 


1101000 
0110100 
0011010 
0001101 (9.2) 
1000110 
0100011 
1010001 


The first row has 1 in positions 1,2,4 since the first block is {1,2,4}. Note 
that the rows correspond to the blocks and the columns correspond to the 
elements. Note too that the matrix A depends on the order in which the 


blocks and the elements are taken. However, it turns out that the important 
properties of A do not depend on the particular orders chosen. 


Theorem 9.3 


If A is the incidence matrix of a (vu, k, A) design, then 
ATA=(r—-A) I +40 (9.3) 


where r is as in (9.1), and where J is the v x uv matrix with every entry equal 
to l. 


Proof 


The (i, j)th entry of AT A is the scalar product of the ith row of AT and the jth 
column of A, i.e. of the ith and jth columns of A. Thus the (7,2) diagonal entry 
is the scalar product of the ith column of A with itself, and so is the number 
of 1s in the ith column. But this is just the number of blocks containing the 
ith element, which is r. 

If i #4 7, the scalar product of the ith and jth columns is just the number of 
places in which both columns have a 1. This is the number of blocks containing 
both the ith and jth elements, which is . So all diagonal entries of A? A are 
r, and all non-diagonal entries are . 

An important consequence of this result is the fact that a (v,k,A) design 
cannot contain fewer blocks than elements. This result was first obtained by 
the statistician R.A. Fisher in 1940. 


Theorem 9.4 
In any (v,k, A) design, 6 > v. 


Proof 


We give a matrix proof, using Theorem 9.3 and basic properties of determinants 
(a purely combinatorial proof is outlined in Exercise 9.21). 
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Let A be the incidence matrix of the design. Then, denoting the determinant 
of a matrix M by |M|, we have 


r XX... dX r a r ghia Xr 
A r Xr... dX A-r r-X 0 0 
A A Tr... AlalA—r 0 r- x 0 
JA AXA... rl [Aer 0 0 on FN | 


on subtracting the first row from each of the other rows. We now add to the 
first column the sum of all the other columns to obtain 


| r+(u—J)a x x nee X 

0 r—AX 0 sit 0 

[AT Al = 0 0 r-A ... 0 
0 0 0 s+ pro 


{r+ (v—1)A}(r — AP 


= rk(r—-A)’ 


since, by (9.1), r+(v—-LA=rir(k—1) =rk. Nowk < v so, by (9.1),7r >A, 
and so |A7 A] 4 0. Now A’ A is a v x v matrix and so its rank p(AT A) must 
be v. But (A? A) < p(A), where p(A) < number of rows of A; so p(A7 A) <b. 
Thus v < b as required. ca 


Example 9.7 


We can show that no (25, 10, 3) design can exist. If it did, then (9.1) would give 
72 = 9r and 10b = 25r, whence r = 8 and b = 20. But this gives b < v. 


A (v,k, A) design with 6 = v is called a symmetric design. Note that if b =v 
it follows from (9.1) that r = & and 


Av — 1) = k(k — 1). (9.4) 
Also, (9.3) reduces to 
ATA=(k—A)I 4+ A. (9.5) 


A symmetric design is not so named because its incidence matrix is symmetric 
- usually it is not! - but because of a symmetry between some of the properties 
of the blocks and the elements. In a symmetric design we have, since r = k, 


every block contains k elements; 
every element is in k blocks. 
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We also have 
every pair of elements is in 4 blocks 
and we are now going to establish that 


every pair of blocks intersect in 4 elements. 


Theorem 9.5 


If A is the incidence matrix of a symmetric design then AA? = AT A. 


Proof 


Since AJ = kJ and JA = rJ, it follows from r = k that AJ = JA. Thus 

A commutes with J, and so A commutes with (k — A) + AJ = A7A. Thus 

AAT = A{(k—A)I4+AJJA7} = {(kK-A)I 4+ AJ} AAM! = (K-A)I4AJ = ATA. 
r 


Corollary 9.6 


In a symmetric (v, k, A) design, every pair of blocks intersect in A elements. 


Proof 


The (i,j)th entry of AA? is the product of the ith row of A and the jth 
column of A’, i.e. the product of the ith and jth rows of A. But this is just the 
number of columns in which the ith and jth rows both have 1, i-e. the number 
of elements in both the ith and jth blocks. By Theorem 9.5 this number is 4 
whenever i # j. 


Once consequence of these symmetry properties is that if we take the incidence 
matrix A of a symmetric design and interchange rows and columns (to obtain 
A’), the A? is also the incidence matrix of a symmetric design, called the dual 
design. For example, the transpose of the incidence matrix of the seven-point 
plane of Figure 9.1 is 


1000101 
110001 0 
0110001 
10110 0 0 
0101 10 0 
001011 0 
0001011 


which is the incidence matrix of the seven-point plane with blocks {1, 5,7}, 
{1,2,6}, {2,3, 7}, {1,3, 4}, {2, 4, 5}, {3, 5,6}, (4,6, 7}. (Note however that the 
relabelling 1 > 7,2 + 6,... ,7 — 1 reveals that this second plane is in fact the 
original one in disguise!) 
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Complementary designs 


Given a (v, k, A) design D, we can obtain another design D from it in which the 
blocks are the complements of the blocks of D. D is called the complementary 
design of D. In Example 9.1, the blocks of size 4 form such a design which is 
complementary to the seven-point plane. 


Theorem 9.7 
Let D be a (v,k,A) design on a set S, with blocks B,,... , By. Then the sets 
fase éf 
\ 


ae OW) Aaetoe ah dua M0 =. Bo pe oY dad ean 
U,U- KA) GSSIZN wWnere A —U— 47F + A, PrOviIadea A > U, 


Proof 


Since |B;| = k for each i, it is clear that |B;| = u — k for each i. We have to 
show that every pair of elements of S lie in exactly \' of the blocks B;. Now if 
z,y € S, then z,y € B; precisely when neither x nor y belongs to B;. But by 
the inclusion-exclusion principle, the number of blocks B; containing neither z 
nor y is 


b— (no. of blocks containing rz) — (no. of blocks containing y) 
+ (no. of blocks containing both x and y) 
=b-2r+X. 


Example 9.8 


The complementary design to the seven-point plane is a (7,4,2) design since 
MN = b-—2r+X = 7-641 = 2. The blocks of this design are the groups of size 
4 in Example 9.1. 


Theorem 9.8 


The complement of a symmetric design is also symmetric. 


Proof 
If D is symmetric, 6 = v, and so D also has 6 = v blocks. 


9.2 Resolvable Designs 


Example 9.9 ( The Kirkman schoolgirls problem) 


In 1850, Kirkman posed the following problem: “Fifteen young ladies in a school 
walk out three abreast for seven days in succession: it is required to arrange 
them daily, so that no two shall walk twice abreast”. 
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If such an arrangement exists, the triples of girls will form a STS(15). Accord- 
ing to the proof of Theorem 9.2, any such design has 6 = 5-15.14 = 35 blocks; 
the problem is to group these blocks into seven groups of 5 blocks, the blocks 
in each group forming a partition of the set of 15 schoolgirls. This requirement 
is similar to that for the games in a league schedule for 2n teams, where the 
games (pairs) are arranged into 2n — 1 groups of n games, the pairs in each 
group forming a partition of the set of 2n teams. 


Definition 9.4 


A (v, k, 4) design on a set S is resolvable if the blocks can be arranged into r 
groups so that each group forms a partition of S. The groups are then called 


: 
the resolution or parallel classes. 


(Note why there have to be r groups: each element occurs in r blocks, and 
has to occur in precisely one block of each group. Note too that a resolvable 
design can only exist when k|v.) 


Here, for example, is a solution to the Kirkman schoolgirls problem. In this 
resolvable (15, 3, 1) design the blocks are grouped into seven groups of 5 blocks, 
each group partitioning {1,... , 15}. Read the groups horizontally. 


1,8,15 2,4,10 3,7,12 5,6,9 11,13,14 
2,9,15 3,5,11 4,1,13 6,7,10 12,14,8 
3,10,15 4,6,12 5,2,14 7,1,11 13,89 

4,11,15 5,7,13 6,3,8 1,2,12 14,9,10 
5,12,15 6,1,14 7,4,9 2,3,13 8,10,11 
6,13,15 7,2,8 1,5,10 3,414 9,11,12 
7,14,15 1,3,9 2,6,11 4,5,8 10,12,13. 


Affine planes were constructed in Section 7.4. Starting with a complete set of 
MOLS of order n, we constructed an affine plane with n? points and n? +n 
lines. We also saw that the lines could be grouped into n + 1 groups of n 
parallel lines: in other words, the affine plane so obtained was resolvable. We 
now show that all (n?,n,1) designs, no matter how they are obtained, must be 
resolvable. 


Theorem 9.9 


Every (n?,n, 1) design is resolvable. ‘ 


Proof 


From (9.1), we have r = n+1 and b = n? +n. We first show that, given 
any block B = {by,...,6,} and any element z ¢ B, there is a unique block 
containing z« which does not intersect B. (Compare this with the following: 
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given a line @ and a point P not on @, there is a unique line through P which 
does not meet £ (i.e. which is parallel to £).) 

For each }; € B, there is a unique block B; containing both 5; and z. Clearly 
B; # B; whenever i # j, for otherwise 6; and b; would be contained in both B 
and B;, contradicting the fact that A = 1. So we get n blocks B; containing « 
and intersecting B. But altogether there are r = n + 1 blocks containing 2; so 
there must be exactly one block containing s and disjoint from B. 

Next we note that if C; and C2 are any two blocks disjoint from B then C, 
and C, do not intersect. For suppose z € C, C2; then z would be in more 
than one block which is disjoint from B, contrary to what we have just proved. 

So consider the n? —n elements not in B. For each such z, there is a unique 
block containing z and disjoint from B. Since each block has size n, we therefore 
have nin 
thus forming a resolution class with B. So each block is contained in a resolution 
class of n blocks, and so the design is resolvable, two blocks being in the same 
class if and only if they are disjoint. : 


n pas Is Pee Pee | Oe Die ra] Dy sag fhe DETAR EEST | 1 
= 7) — 1 DIVUCKS GISJOML IOI OD alld GisjUlUl TPO Ole @MOLNeELr, dud 


We can now use resolvability to get back from affine planes to MOLS. 


Theorem 9.10 


An affine plane of order n exists < a complete set of n —- 1 MOLS of order n 
exists. 


Proof 


The proof of < was given in Section 7.4, so we now consider the reverse impli- 
cation. Suppose that a (n?,n,1) design exists. It is necessarily resolvable, and 
it has r = n+ 1 resolution classes. Select two such classes, say {B,,... , Bn} 
and {C,,... ,C,,}. Since each point of the plane lies in exactly one RB; and one 
C; we can give it unique “coordinates” (3, 7). 

There remain n — 1 other resolution classes, and we construct a latin square 
for each. For the resolution class {£),..., Zn}, define ann xn matrix E = (e;;) 


by 


ej, =k where the point (i, 7) lies in Ey. 


First we check that £ is a latin square. If e,; = e,j with 7 # J then (i, 7) and 
(i, J) lie in Ey but also in B;,, contradicting A = 1. So no row of E has repeated 
elements, and a similar argument applies to columns. 

Finally we check that if {F,,..., F,} is another resolution class then E and 
F are orthogonal. So suppose e;; = e7y(= k) and fi; = frs(= 2). Then (i,j) 
and (I, J) both lie in E, and also in F;, again contradicting \ = 1. 

So we obtain n — 1 mutually orthogonal latin squares, as required. 
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Example 9.10 
Take the affine plane constructed in Example 7.10. Take 


By = {1, 2,3}, B, = {4, 5, 6}, B; = {7, 8, 9}, 
Cy = {1,4, 7}, C2 = {2, 5,8}, C3 = {3, 6, 9}, 
Ey = {1, 6, 8}, Ey _ {2, 4,9}, E, = {3, 5, 7}, 
F, = {1, 5,9}, F, = {2,6,7}, F3 = {3, 4,8}. 
Then, for example, the point (1, 1) is 1, (1,2) is 2, (2,3) is 6. Since 1 € Fy, e11 = 
1; since 2 € E2,e12 = 2; since 6 € E,, €23 = 1. In this way we obtain 
2 3 ] [ 12 3 ] 
3 1 and F=[3 12]. 
1 2 23 1 


9.3 Finite Projective Planes 


In searching for symmetric designs it is natural to look for such designs with 
A = 1. The seven-point plane is one such design. In general, if we have a design 
with k =n+1 and \=1 then \(v—1) = k(k—1) gives v-1 =n? +n, so that 
the design is a (n?+n+1,n+1, 1) design. Conversely, any (n?+n+1,n+1,1) 
design must, by (9.1), have n? +n =rn and b(n + 1) = (n? ++ 1)r, so that 
r=n+1l=kandb=n?+n+4+1 =; 80 the design must be symmetric. 


Definition 9.5 
fi 


np Bete 
, @ Tit 


o 
as) 


Thus in a FPP there are equal numbers of blocks and elements. They mimic the 
lines and points of a geometry: any two blocks (lines) intersect in one element 
(point), and any two points lie on a unique line. The seven-point plane is a 
FPP of order 2. 


Example 9.11 


The blocks {1,2,4,10}, {2,3, 5,11}, {3,4,6, 12}, {4, 5, 7,13}, {5,6,8,1},..., 
{10, 11, 13,6}, {11, 12,1, 7}, {12, 13, 2,8}, {13, 1, 3,9} are the lines of a FPP of 
order 3, i.e. the blocks form a (13,4,1) design. Note the cyclic nature of the 
design (modulo 13). It is now known that such a cyclic FPP of order p exists 
for all primes p (modulo p? + p + 1). 


We now show the fundamental connection between projective and affine planes. 
An artist does not draw parallel lines as parallel; the two sides of a road converge 
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as the road disappears into the distance. The artist is using projective geometry 
to represent euclidean (affine) geometry. So let us follow this idea, and, for each 
of the n + 1 resolution classes of an affine plane of order n, add a “point at 
infinity” to each block of that class. Thus, for each i < n+ 1, we add a new 
point 00; to each block of the ith class. The blocks now have size n + 1, there 
are n?+n+1 points, and it should be clear that the property of any two points 
lying on a unique line is preserved, except that there is no line containing any 
two of the new points 00;. So if we introduce one new line {o01,... ,0On41} 
we have a (n?+.n+1,n + 1,1) design, ie. a FPP of order n. The line thus 
introduced is sometimes called the line at infinity. 


Example 9.12 


Take the affine plane of order 3 used in Example 9.10, and introduce four 
“points at infinity”, which we denote by 10,11,12,13. We obtain a FPP of 
order 3 with the following lines: 


{1,2, 3, 10} {4,5,6,10} {7,8,9,10} 
{1,4,7, 11} {2,5,8,11} {3,6,9,11} 
{1, 6, 8, 12} {2,4,9,12} {3,5,7,12} 
{1,5, 9,13} {2,6,7,13} {3,4,8, 13} 
{10, 11, 12, 13}. 


We have thus established one half of the following theorem. 


Theorem 9.11 


There exists a FPP of order n © there exists an affine plane of order n. 


Proof 


We have dealt with < above. So now suppose there exists a FPP of order n. 
Take any line € = {m,... , pn4i} of the plane. (We are going to treat it as if 
it were the line at infinity.) Since @ intersects every other line in exactly one 
point, each other line contains exactly one of the p,. So if we throw away @ and 
remove its points wherever they occur, we end up with n? 4+ n lines each with 
n points. Any two points lie in exactly one line, namely the remnant of the line 
of the FPP on which they lay. So we have an affine plane of order n, in which, 
for each 7, the remnants of the lines of the FPP containing p, form a resolution 
class. 


Combining Theorems 9.10 and 9.11 together, we obtain the following remark- 
able result. 


Theorem 9.12 


The following statements are equivalent. 
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(i) There exists a complete set of n — 1 MOLS of order n. 
(ii) There exists an affine plane of order n. 
iii) There exists a finite projective plane of order n. 


In view of Theorem 7.2 and the note following, FPPs of prime power order 
exist. It is conjectured that if n is not a prime power then no FPP of order n 
exists. This has heen confirmed for n = 6 and 10, but the case n. = 12 is still 


open. 


9.4 Hadamard Matrices and Designs 


We begin this section by introducing a family of matrices all of whose entries 
are +1. From these matrices an important family of designs will be obtained. 

Two vectors will be called orthogonal if their scalar product is 0. For ex- 
ample, any two row vectors in the matrix 


1 1 1 1 

1 -l 1 -1 ; 

1 1 -1--1 eS 
1 -1 -1 1 


(6) 


are orthogonal, as are any two columns. 


Definition 9.6 


A (+1, -1) matrix is a matrix all of whose entries are +1. An n x n(+1,—1) 
matrix H is a Hadamard matrix of order n if HH? = H7H =nl. 


Note that HH? = nl is essentially saying that H is invertible with inverse 
1H T. Since a matrix commutes with its inverse, either of the properties HH? = 
ni and H?H = mJ follows from the other. Note too that HH? = nl is precisely 
the property that any two rows of H are orthogonal, and similarly H7 H = nI 
is equivalent to demanding that the columns of H are orthogonal. Hadamard 
matrices are so named after Jacques Hadamard who, in 1893, showed that any 
teal n x n matrix H, whose entries hj; all satisfy |hi;| < 1, has determinant 
at most n™/2, equality occurring only if HH? = nJ. But since then Hadamard 
matrices have cropped up in many areas of combinatorics, and have even been 
behind the sending of photographs from Mars to Earth. This will be explained 
in the final section. 

There is a straightforward way of constructing Hadamard matrices of order 
27 

Let 

Het m=(1 | 
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and, for each m > 1, define H,,, inductively by 


Am-1 Am-1 


Hy = 
Ay-1 ~4im—1 


Then it is straightforward to check that Hm is a Hadamard matrix of order 
2™; consider the scalar product of any two of its rows. Thus, for example, H, 
is just the matrix (9.6). 

But for other values of n does a Hadamard matrix of order n exist? 

We shall show that if n > 2 then n has to be a multiple of 4. Before proving 
this, it is helpful to note that if we have a Hadamard matrix and we multiply any 
row or column by —1 then the resulting matrix is still Hadamard; orthogonality 
of the rows and columns is preserved. So we can always normalise a Hadamard 


eae Geen -cheer by My ei pom 
ay e1iy in the first Tow and the first column equal to +1. 


ae Go Pe 
matrix, i 1.6, Make €V 


Theorem 9.13 


If there exists a Hadamard matrix H of order n > 2, then n must be a multiple 
of 4, 


Proof 


Assume that H has been normalised, so that the first row has +1 in each 
position. Since the rows are orthogonal, the second row must have the same 
number of +1s and ~—1s; so there must be } +1s and } —1s, so that n is 
necessarily even. By rearranging the order of the columns, we can suppose that 


H has first two rows 


| ae eee ds Ue 
1s ote. es ey es Gea 

a eoneidex the think vows nose that hb of ate Bist, @ Spice are 

If n > 2, consider the third row of H. Suppose that h of its first % entrics are 

+1, and k of its second } entries are +1. Then } — h of its first $ entries are 


f=3) 


—1, and } —k of its second > entries are —1. Since the first and third rows are 


orthogonal, i i. 
h- (ZA) +k—-(5-4) =O, 


ie. h+k = }. Also, since the second and third rows are orthogonal, 
n n 
h—-(5—h)—k+(5—k)=0, 
ie. h =k. Thus h=k = §, and n has to be a multiple of 4. 


Since a similar argument can be applied to the columns, we have: 


Corollary 9.14 


In any normalised Hadamard matrix of order 4m, any two columns other than 
the first have +1s together in exactly m places. 
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Hadamard designs ~- © ' 


If a Hadamard matrix of order 4m exists, we can obtain a design from it. 
The basic idea is to take a normalised matrix, remove its first row and column, 
replace each —1 by 0, and interpret the resulting matrix as the incidence matrix 
of a design. ine 


Theorem 9.15 
If a Hadamard matrix of order 4m exists, then a (4m—1,2m—1,m-—1) design 


evicte 
CXS. 


Proof 


Let H be a normalised Hadamard matrix of order 4m. As in the proof of 
Theorem 9.13, each row and column of H, apart from the first, must have 
2m +1s and 2m —1s. Remove the first row and column of A, and replace each 
—1 in the resulting matrix by 0. This results in a (4m—1) x (4m—1) (0, 1)-matrix 
A in which every row and every column has 2m 0s and 2m — 11s. 

We interpret A as the incidence matrix of a (necessarily symmetric) design. 
It has 4m — 1 blocks and 4m — 1 elements; each block contains 2m — 1 elements 
and each element lies in 2m — 1 blocks. Consider now any two elements. The 
columns of A corresponding to these two elements have (by Corollary 9.14) 
m — 1 1s together, ie. these two elements occur together in exactly m — 1 
blocks. So the design is balanced with 4 = m — 1. 


UE AGE mabe 


Definition 9.7 
A (4m —1,2m — 1,m — 1) design is called a Hadamard design. 


Example 9.13 


The doubling construction gives, from (9.6), a normalised Hadamard matrix of 
order 8. Removing the first row and column and changing every —1 to 0 gives 
the following matrix: 


0101010 
1001100 
0011001 
1110000 (9.7) 
0100101 
1000011 
0010110 


This is the incidence matrix of a (7,3, 1) design, i.e. a FPP of order 2, whose 
blocks are: 


{2, 4, 6}, {1, 4, 5}, {3, 4, 7}, {1, 2, 3}, {2,5, 7}, {1,6, 7}, {3, 5, 6}. 
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There is, in fact, essentially only one FPP of order 2. The plane just found 
can be “converted” into the plane of Figure 9.1 by replacing 1, 2, 3,4, 5, 6,7 by 
1,6,5,4, 2,3, 7 respectively. 

The construction presented in the proof of Theorem 9.15 can be reversed: 
given a Hadamard design, replace each 0 in the incidence matrix by —1 and add 
a first row and column with all entries +1. The resulting matrix is Hadamard. 
So we have the following result. 


Theorem 9.16 


A Hadamard matrix of order 4m exists a Hadamard (4m —1,2m~1,m-—1) 
design exists. 


This result enables us to construct Hadamard matrices of order 4m provided 
that we can construct the corresponding design. Many methods of constructing 
Hadamard designs have been found, but we give only the simplest of these 
constructions. 

Let p be any prime of the form p = 4m — 1, and take the squares of 
V2 each: ,3(P — 1) (mod p). For example, if p = 7, the squares of 1,2,3 are 
1,4,2 (mod 7) since 9 = 2 (mod7). So we obtain the set {1,2,4} which we 
observe is precisely the set used to generate the seven-point plane in Exam- 
ple 9.1. In general, starting with any prime p = 4m — 1, the set of squares 
of 1,2,..., 5(P — 1) (mod p) acts as a starting block for a cyclic Hadamard 
(4m — 1,2m —1,m —1) design. This will be justified in the next section (The- 
orem 9.18). See the appendix for details of squares (mod p). 


Example 9.14 


Take p = 11. The squares of 1,... ,5 (mod 11) are 1,4,9,5,3. So take the 
set {1,3,4,5,9} as the starting block and obtain other blocks by successively 
adding 1 (mod 11) to each entry of the block. The resulting (11,5, 2) design has 
incidence matrix 


r101311000100 
01011100010 
00101110001 
10010111000 
01001011100 
00100101110 (9.8) 
00010010111 
100010031011 
11000100101 
11100010010 
01110001001. 
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and from this we obtain the following Hadamard matrix of order 12 (where 
+, — stand for +1,-—1 respectively) \ 


f+ + + + 4+ 4+ + 4+ + 4+ 4+ 4+ 
+ + —- + + 4-7 ->- + - | 
4+ - +--+ +H - - - $= 
t+ - - + - + + + > - - + 
a 
+ - +--+ - + 4+ 4 - - 
+--+ - - tr tH FH] 
ae ieee 
++ --- + -- + - + + 
4+ 4+ 4-5-4 ->-4+ - 4 
++ 4+ 4+ --- + - - + = 
+ - ++ 4+ > -- + > - + 


It is conjectured that a Hadamard matrix of order 4n exists for all positive 


integers n. But this is far from being proved. 
ae 


9.5 Difference Methods 


The seven-point plane can be constructed by starting with {1, 2,4} and obtain- 
ing further blocks by adding 1 (mod 7) successively, to obtain {2, 3, 5}, {3, 4, 5}, 
etc. What is special about the choice of 1,2,4 which makes the method 
work? Similarly, the block {1,3,4,5,9} was used in Example 9.14 to obtain 
a Hadamard design; what is special about this choice? 

Consider the differences between elements of {1,2,4} modulo 7. They are 
+(2—- 1),4(4- 2),4(4— 1), ie. £1, +2, 43, ie. 1,2,3,4,5,6, ie. all the non- 
zero numbers (mod 7), each occurring once. Similarly, consider the differences 
between elements of {1,3,4,5,9}(mod 11); they are +2, +3, +4, +8, +1, £2, +6, 
#1, +5, +4,ie.+1,+2,+3, +4, £5 twice, ie. all the non-zero numbers (mod 11) 
each occurring twice. Note that the first gives rise to a design with A = 1, and 
the second gives rise to a design with \ = 2. 


Definition 9.8 


(i) Let Z, denote the integers modulo v. A k-element subset D = {d),... , de} 
of Z, is called a cyclic (v,k, X) difference set if \ > 0,2 < k < v, and 
every non-zero d € Z, can be expressed as d = d; — d; in exactly A ways. 

(ii) If D is a difference set, the set D+ a = {d, + a,... ,d, + a} is called a 
translate of D. 


Thus the seven-point plane of Example 9.1 consists of the translates of {1, 2, 4}. 
This is a special case of the following general result. 
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Theorem 9.17 


If D = {dj,..., dg} is a cyclic (v,k,\) difference set then the translates 
D+i,0 <i<v-—1, are the blocks of a symmetric (v, k, A) design. 


Proof 


Here D +i = {d, +%,... ,d, + i}. Clearly there are v translates, each of size 
k. So we have only to check the balance property. Two elements x, y are in the 
same translate D + a if and only ifr =a+d,,y =a+d,; for somei # J, ie. 
2-a=d,,y-a=d,,ie. (x—a,y — a) is one of the X pairs (d,, dj ) such that 
dj-~dj ==z-y. 


We can now see why the “starter” block {1,2,4,10} in Example 9.12 gives rise 
to a FPP of order 3. The differences are +1,+3,+9, +2, +8, +6, ie. all the 
non-zero numbers (mod 13), each once, so {1,2, 4,10} is a (13,4,1) difference 
set. 


Example 9.15 


(i) {1,2,5, 15,17} is a (21,5, 1) difference set in Za, and its translates form a 
FPP of order 4. 


(ii) {1, 2,7, 19, 23, 30} is a (31,6, 1) difference set, leading to a FPP of order 5. 


(iii) {1,3,4,5,9} is a (11,5, 2) difference set and its translates form a (11,5, 2) 
design as in Example 9.14. 


(iv) The set {2,6,7,8, 10,11}, which is the complement of the difference set 
of (iii), is itself a (11,6,3) difference set, the translates of which are the 
blocks of a (11,6, 3) design. This design is the complementary design (The- 
orem 9.7) of the Hadamard (11, 5,2) design of Example 9.14. 


It is now known that a (p? + p+1,p+1,1) difference set exists for all primes p, 
leading to a cyclic FPP in each case. It cannot be emphasised too strongly how 
useful this difference set technique is in the construction of symmetric designs. 
Of course the method is useful only if difference sets can be constructed. One 
method of construction was mentioned in the previous section in connection 
with Hadamard designs, so we now show why that method of construction 
works. 

The number-theoretic facts required for the proof can be found in the ap- 
pendix. 


Theorem 9.18 


Let p = 4m — 1 be prime. Then the non-zero squares in Zp form a 
(p, (p — 1), ;(p — 3)) difference set. 
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Proof 


Since (—x)? = x?, each square arises twice; as shown in the appendix, exactly 
half of the non-zero elements of Zp are squares. Thus there are $(p— 1) non-zero 
squares. 

Let w be any non-zero square, say w = t?(mod p). Then for each represen- 
tation 1 = 2? — y? as a difference of squares, there corresponds the repre- 
sentation w = (tz)? — (ty)?. Conversely, if s is the inverse of t(mod p), then 
corresponding to each representation w = 2? — y? we have a representation 
1 = 5*t? = 5*w = (sx)? — (sy)*. So all non-zero squares w in Z, have the same 
number of representations as a difference of squares. 

Further, since p = 3(mod 4), —1 is not a square in Z, and the non-squares are 
precisely the negatives of the squares. So, corresponding to any representation 
z = 2? — y? of the non-square z, we have the representation —z = y? — x? of 
the square —z. So all non-squares and squares have the same number, say 4, 
of representations as a difference of squares. The value of 4 is obtained from 


Mv—1) = k(k—1): we have A(p—1) = 4(p—1)-4(p—3), whence A = }(p—3). 


This difference method can be extended to non-symmetric designs. For example 
consider the construction of a league schedule for 2n teams as described in 
Section 8.1. Apart from the game involving oo, the first round games were 


lv (2n — 2),2v (2n — 3),... ,(n-—I) vn. 


The pairs {1,2n — 2}, {2,2n — 3},...,{n — 1,n} have differences +(2n — 
3), £(2n — 5),...,+1, ie. every non-zero member of Zon-1. 

Again, consider the STS of order 13 presented in Example 9.7. The blocks are 
the translates of the two initial blocks {1,2,5} and {1,3,9}. These two blocks 
have differences +1,+3,14 and +2, +6, +8, i.e. all the non-zero members of 


Zy3, each exactly once. 


Example 9.16 


The blocks {1, 2,9}, {1,3, 17}, {1,5, 14} have differences +1, +7, £8, +2, +14, 
+16, +4, +9, +13 (mod 19), i.e. each non-zero element of Zig once. So the 
translates will form a STS(19). 


9.6 Hadamard Matrices and Codes 


A binary code is a collection of n-digit binary sequences, called codewords. 
If codewords are transmitted then it is possible that errors will arise due to 
interference, and so the received codewords may differ in some places from 
those that were sent. The basic idea behind an error-correcting code is to choose 
the codewords to be sufficiently different from each other so that even if some 
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errors in transmission occur, each received word is “closer” to the transmitted 
codeword than to any other. We have here the concept of the distance between 
two codewords, namely the number of places in which they differ. If all the 
codewords are chosen so that each pair differ in at least 2t + 1 places, then, 
even if ¢ errors are made in the transmission of a codeword, the received binary 
sequence will still be closer to the original than to any other, and hence can be 
correctly decoded as the codeword nearest to it. 


Definition 9.9 


A binary code of length n is a set C of n-digit binary sequences, called code- 
words. The (Hamming) distance d(x, y) between any two codewords x, y is 
the number of places in which they differ, If dfx.v) > 2441 for all codewords 


one TUMVer CL pirates 2th WeNCin LOY QMIer, 22 Gm, Vy - 2 20T Gas COGCWOrGS 


x, y,x # y, the code C is said to be a t-error-correcting code. 


Example 9.17 


The four codewords 0000000, 1111111, 1010101, 0101010 differ from each other 
in at least 3 places, and so form a 1l-error-correcting code. For example, if 
1010101 is sent, and due to interference 1110101 is received, 1110101 is closer 
to 1010101 than to any other codeword and hence will be decoded correctly. 

There are two conflicting aspects of a code. Given n, it would be desirable 
to have the minimum distance between any two codewords as big as possible 
(to enhance error correction) but it would also be desirable to have as many 
codewords as possible. But these properties conflict: you cannot have too many 
codewords which are all a large distance apart. So we have a fundamental 
problem: given n and k, how many binary sequences of length n can we find 
such that each pair of binary sequences differ in at least k places? 


We shall look at a varticular case of this nrohlem. namely when bk — fl. 


SNaik 1008 at a particular Case OF this prooierm, Name:ny when « 


First we consider the case when n is odd. 


Lemma 9.19 


Suppose that there are N binary sequences of length n = 2m — 1, any two of 
which differ in at least m places. Then N <n +1. 


Proof 


Consider the N sequences as the rows of an N x n (0, 1)-matrix. Let 5 denote 
the sum of all the distances d(x, y) between the sequences: 


S= >> d(x,y). 


x#y 


Here the sum is over all (7) pairs of distinct sequences x, y. 
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We shall count S in two different ways. For each pair x, y,d(x,y) > m; so 
we obtain immediately that 


s2(")m a (9.9) 


Now consider the jth column of the matrix. If it contains a; 0s and b; 1s, then 
a; +b; = N and, since each of the a; 0s gives a difference with each of the b; 
Is, there is a contribution of a;b; to S from this column. So S$ = SY7_, ajby. 

Now it is easily checked (e. g. by calculus) that if2+y = N, then the maximum 


possible value of xy is Ne So each a,b; is at most N? and we have: 
pos: 7° 90 each a70; 18 at most “1, 

Pian 
From (9.9) and (9.10) we obtain , ge-3 


N(N-1) _ nN? 

SN DEE 

me aa ee 
whence 


(2m —n)N < 2m, — ° 


i.e. (since n = 2m — 1) 
N<n+l. 


Example 9.18 


How many codewords of length 11 can be found, such that each pair differ in 
at least 6 places? 


Solution 


By Lemma 9.19, we cannot hope to find more than 12 such codewords. But, 
further, we can find as many as 12 by using the Hadamard design of Example 
9.14. The 11 rows of the incidence matrix (9.8) each have 5 1s; but any two 
rows share only two 1s; so any two rows differ in 2 x (5-2) = 6 places. So if we 
take the 11 rows as codewords, along with the row of all 1s (which will differ 
from the other 11 in 6 places), we obtain 12 codewords as required. 


Corollary 9.20 


Let C be a code of length n = 2m, containing N codewords, each pair of which 
differ in at least m places. Then N < 2n. Further, if a Hadamard matrix of 
order n exists, then there exists such a code with 2n codewords. 


Proof 


Consider the codewords of C which begin with 0. Omitting this 0 from them, 
we obtain codewords of length 2m — 1 differing in at least m places. By Lemma 
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9.19 there can be at most 2m such codewords, Similarly, there can be at most 
2m. codewords of C beginning with 1; thus C has at most 4m = 2n codewords. 

If n = 2, then C = {11, 10,01, 00} satisfies the requirements. So suppose now 
that there exists a Hadamard matrix H of order n > 2. Then n = 4u for some 
u. Any two rows of A differ in exactly 2u = } places, and agree in exactly } 
places. 

Let A denote the matrix obtained from H by replacing —1 by 0, and let A 
denote the matrix obtained from A by interchanging 0s and 1s. Then any two 
rows of A differ in at least } places, as do any two rows of ‘A and a row of A 
and a row of A differ in either # or all n places. So the rows of A and A give 
the required 2n codewords. 


Example 9.19 

Take the Hadamard matrix Hs constructed by the duplication method of Sec- 
tion 9.4. It has 32 rows and columns. Let A denote the matrix obtained from 
Hs by replacing each —1 by 0, and let A be obtained from A by interchanging 
Qs and 1s. Then the rows of A and 4 form a code of 64 codewords of length 
32, each differing in at least 16 places; so the code is 7-error-correcting. Such 
a code was used in the 1972 Mars Mariner 9 space probe to Mars, to send 
photographs back to Earth. Each photograph consisted of lots of dots of dif- 
ferent shades of grey (64 shades needed 6-digit binary sequences to represent 
them, since 2° = 64), and the sequence of coded shades was encoded using the 
7-error-correcting code which we have just described. The resulting pictures 
were remarkably good! 


More recent spaces probes have used much more sophisticated codes, as do 
compact discs and other modern gadgets. We refer the reader to Hill [12] for a 
good introduction to coding theory. 

There is one final connection between codes and Hadamard designs with 
which we shall bring this chapter to a close. Before describing it, we need first 
to find a bound on the number of codewords in a t-error-correcting code. 


Theorem 9.21 


If C is a binary t-error-correcting code of length n, then 
Tr n nr 
F ante < 2", E 
er {()+G) ++ @)}s om 
Proof 


Any two codewords of C differ in at least 2¢ + 1 places. Thus any sequence 
differing from a codeword x in at most t places will be corrected to x. 

Now, for each i, 0 <i < t, there are (7) binary sequences of length n differing 
from x in i places; so the number of sequences correctable to x is 7} (”). 
Thus, since there are |C| possible choices of x, there must be at least |C| 3-!_, (") 


i=0 
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binary sequences of length n. Since there are precisely 2” binary sequences of 
length n, the result follows. 


A code C is said to be perfect if there is equality in (9.11). For such a code C, 
every binary sequence is correctable to a codeword of C, i.e. every sequence is 
at distance at most t from a unique codeword. Note that for equality to hold 


nm 


in (9.11), we must have (5) +--- + (7) equal to a power of 2. 


(C]-(l+n) = 2", 


Consider the case n = 7, so that |C| = 16. We can construct such a code by 
taking the seven rows of the matrix (9.7), the seven codewords obtained from 
these by interchanging 0s and 1s, and the row of all Os and the row of all Is. 
This gives 16 codewords as follows. 


0 1 0 0 


a Pees 


cocorocorore 
corre 
OoOrcoorro 
ecco COr FF 
rPoreor 
Oorrocooqor 
ocrrFrOor oc oO 
BPrPorordcr 
PrPrROoOCcrrF oO 
pe hae, Rh. peas eae A 
rPOoOrFOrFrRrF OF 
FPoOoOOrrF re FO 
eK OCcor Oe = 


0 0 1 


Another way of obtaining these codewords is to take four of them which are 
linearly independent over Zo, and to take all linear combinations of them over 
Zz. In other words, we choose four linearly independent codewords c,,... ,¢4 
and take all linear combinations 4,¢, + Agc2 + A3¢3 + Age4 where each A, is 0 
or 1. For example, if we take the first four codewords above, 


c; = 0101010, cz = 1001100, cz =0011001, c, = 1110000, 


we then have 
0100101 = co+ce3+ cq, 


1010101 = cz +¢z, 


etc. Thus C consists of all linear combinations (mod 2) of the rows of the matrix 


01031010 
gafi 291100 
~1o 011001 

1110000 


which is called a generator matrix for C. C is called a linear code. Lin- 
ear codes have special advantages over other codes. They all include 0 as a 
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codeword, and the minimum distance between any two codewords is just the 
minimum distance of codewords from 0, i.e. the minimum number of 1s in a 
non-zero codeword (see Exercise 9.19). In the above example, this number is 
clearly 3, so the code is 1-error-correcting. For each n of the form 2™ — 1 there 
is a perfect linear 1-error-correcting code of length nm whose generating matrix 
has k rows, where k = n — m. These Hamming codes are described in [12]. 

For t > 1, perfect t-error correcting codes are rare. But there is one jewel 
corresponding to the identity 


(B)+ (*) + (73) 4 (73) ut = 928-12, 


\27 \3) 


Example 9.21 (The perfect Golay code Go3) 


Recall from Example 9.15(iv) that {2,6,7,8,10,11} is a (11,6,3) difference 
set (mod 11) whose translates form the design complementary to a Hadamard 
(11, 5,2) design. Let A denote the incidence matrix of this (11, 6,3) design: 


010e0031%1310d121 

1031000131102 
A=/1 1010001110 

Then define the matrix 

1 0 

1 Ty 0 A 
B=]: : . 

1 0 

OO OE a ees 1 


Then B is a 12 x 24 (0,1)-matrix. The code G23 is then obtained by removing 
the first column of B and taking all linear combinations of the rows of the 
resulting 12 x 23 matrix. The reason for first including the first column of B 
is that it makes the arithmetic of the argument simpler. It is clear that, since 
the rows of A differ in 6 places, any two rows of B differ in at least 8 places. 
Our aim is to show that any two linear combinations of rows of B differ in at 
least 8 places, and so any two codewords of G23 will differ in at least 7 places. 
We have to show that, if x is a non-zero linear combination of rows of B, then 
w(x) > 8 where w(x), the weight of x, is the number of 1s in x. We do this 
by a sequence of observations. Let V denote the set of linear combinations of 
rows of B. 
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(i) 


(vi) 


w(x + y) = w(x) + w(y) — 2x-y for all x,y € V. 


This follows since w(x) + w(y) counts twice any 1s in the same place in 
x and y, but these 1s add to 0 (mod 2) and hence have to be subtracted 
twice. 


Every row of B has weight 8 or 12. 


This follows since any two rows of A have 1s together in exactly 3 places. 
if x and y are any two rows of B, w(x + y) is a muitipie of 4. 

This follows from (i), (ii) and (iii). 

Ifx € V and y is a row of B, then x-y is even. 


This is proved by induction. For the induction step consider x = z+r where 
r is a row of B and z is the sum of k rows of B. Thenx-y =(z+r):y= 
z-y+r-y(mod 2), which is even since z-y is even by induction hypothesis 
and r-y is even by (iii). 


If x € Y then w(x) is a multiple of 4. 


Again, this follows by induction. For the induction step take x = z +r as 
in (v). Then w(x) = w(z) + w(r) — 2z-r. But w(z) is a multiple of 4 by 
hypothesis, w(r) is a multiple of 4 by (ii), and z-r is even by (v). 


Since each x € V has weight a multiple of 4, it follows that, to prove w(x) > 8, 
we need only show that w(x) = 4 is impossible. This is achieved by considering 
the left half and the right half of each x € V. Denote by wi(x) and wp(x) the 
weights of the left and right halves of x respectively. 


(vii) 


viii) 


(ix) 


(x) 


w1(x) is even for all x € VY. This follows since if x is the sum of an even 
number of rows of B then the 1s in the first column sum to O(mod 2), 
leaving an even number of 1s from J), while if x is the sum of an odd 
number of rows, x has 1 in the first position and an odd number of 1s from 
qh. 


w(x) = 0, wp(x) = 4 is impossible. 

For if w,,(x) = 0 then x is either 0 (in which case w(x) = 0) or the last 
row of B (in which case wa(x) = 12). 

wi(x) = wr(x) = 2 is impossible. 


For if w(x) = 2 then x must be the sum of one or two of the first 11 rows 
of B, possibly together with the last row of B. But the weight of the sum 
of one or two rows of A is 6 > 2, and if the last row of B is added the 
resulting x has wr(x) = 6 > 2. 


wi (x) = 4,wp(x) = 0 is impossible. 
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Here x must be the sum of 3 or 4 of the first 11 rows of B. If x is the 
sum of 3 rows, let. r be any other of the first 11 rows. Then, since wp (x) = 
0, wr(x+r) = 6, and wi(x+r) = 4, giving w(x+r) = 10, contrary to (vi). 
If x is the sum of 4 rows of B, let t be one of them. Then x = z +t where 
z is the sum of 3 rows of B. Then wy(z) = 4, and wr(z) = wr(x + t) = 
wpr(t) = 6 since wa(x) = 0. So w(z) = 10, again contradicting (vi). So 
w(x) = 4,wr(x) = 0 is impossible. 


Thus the minimum weight of non-zero sums x € Y is indeed 8, and the minimum 
weight of a non-zero codeword of G23 is indeed 7. Thus G3 is a 3-error-correcting 
code as required. 

The code G23 has remarkable connections with Steiner systems, the reader is 
referred to [2] for further details. 


Exercises 


Exercise 9.1 
Show that no (a) (17,9, 2), (b) (21,6, 1) design exists. 


Exercise 9.2 


How would you construct a (13,9, 6) design? 


Exercise 9.3 


Use (9.1) to show that (a) or(k — LA = r2(k -— 1)? + r(k — 1)d, 
(b) (k — 1)A = (k—-1)r —(v—B)D. 


Exercise 9.4 


Show that, in a symmetric (v, k,\) design, k - 1 < VAu < k. 


Exercise 9.5 


Deduce from the proof of Theorem 9.3 that if a (v, k, 4) symmetric design 
exists then its incidence matrix A satisfies |A|? = k?(k — A)". Hence 
show that if a symmetric design exists with v even then k — A must be 
a perfect square. Hence show that no (34, 12,4) or (46, 10, 2) design can 
exist. 
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Exercise 9.6 


Write down the parameters (b,v,r,k,) for the complementary design 
of (a) an affine plane of order n, (b) a (4m—1,2m—1,m-—1) Hadamard 
design. 


Exercise 9.8 


How would you construct a Hadamard matrix of order 24? 


Exercise 9.9 


Show that if D = {dy,...,dx} is a (v,&,A) difference sel in Z, then 
—D = {~d,,... , -d,} and all translates of D are also (v, k, X) difference 
sets. 


Exercise 9.10 
Verify that {1,2,3,5,6,9,11} yields a (15, 7,3) design. 


Exercise 9.11 


Verify that {1,2,8, 20, 24,45, 48,50} is a difference set (mod 57) which 
yields a FPP of order 7. 


Exercise 9.12 
Verify that {1,8, 11, 12, 24} and {1,6, 15, 21, 23} yield a (41,5, 1) design. 


Exercise 9.13 


Verify that the translates of {1,2, 13}, {1, 4,9}, {1, 3, 10}, {1, 5,11} form 
a STS(25). 
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Exercise 9.14 


Verify that the sets {1,2,7}, {1,3, 14}, {1,5, 13}, {1,4,11} have as differ- 
ences all non-zero elements of Zaz except 9 and 18. Deduce that their 
translates, along with nine translates of {1, 10,19}, form a STS(27). 


Exercise 9.15 


Take a 6x6 latin square L, and the 6x6 “natural” array N with 1,... ,36 
in natural order (cf. the 4 x 4 N at the beginning of Section 7.4). Call 
i,j associates if they are in the same row or column of N or if they are 
in positions in N where the corresponding entries in L are equal. Let 


RP. — fa <— Red anA fo ava nnaneintng) Chae that DB, D.. aes the 
Bi = J S vit ana 7 are associatesy. SNOW that 6),... , 35 are the 


blocks of a symmetric (36, 15,6) design. 


Exercise 9.16 


(a) Show that if A is a square (0,1) matrix and B is obtained from A 
by replacing each 0 by —1 then B = 2A — J. 

(b) Show that if A is the incidence matrix of a symmetric (v, k, A) design 
then B = 2A — J is a Hadamard matrix if and only if v = 4(k — A). 

(c) Deduce that Exercise 9.15 enables a Hadamard matrix of order 36 
to be constructed. 


Exercise 9.17 


Can you construct a Hadamard matrix of order 4m for all m up to 12? 


Exercise 9.18 


By following the proof of Lemma 9.19, show that if a code has N code- 
words of length n, each differing in at least d places, where d > $, then 


N < 534. (This is called Plotkin’s bound.) 


Exercise 9.19 


Show that in a linear code the minimum distance between two non-equal 
codewords is the minimum weight of a non-zero codeword. 


Exercise 9.20 


Show that if a perfect 1-error-correcting code of length n exists then a 
STS(n) exists. 
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Exercise 9.21 


ae differ- (A non-matrix proof of Fisher’s inequality.) Choose any 
bags their let z; denote the number of blocks intersecting B in exac 
5(27). Show first that (a) 33; i(3) = (§)(A- 1), (b) Dyiai =k 

j Let m denote the average value of |B 1 C| taken over all 


Thus (c) m(b— 1) =H, izi. 
1 36 Show that 5>,(i — m)?z, > 0 yields 
Vests . 7 ~ Se 7 2m — 1)2 
7.4). Call < (d) (0- 1)k(k -— 1)(A-1) + (6- Dk(r -— 1) > (rr - 1). 
f they are Now use (9.1) and Exercise 9.3 to deduce that (r — k)(r ~- 


qual. Let This implies r > k, whence b > v. 


6 are the (By now you will be glad of the matrix proof!) 


1from A 


\) design 
4(k — A). 
order 36 


p to 12? 


m-equal 


then a 


Appendix 


Arithmetic modulo n 


Let n > 2 be an integer. If a,b are integers, we say that a is congruent to b 
modulo n (and we write a = b(mod n)) whenever a and b differ by a multiple 
of n, i.e. whenever n divides a — b. Thus, for example, 


8=3(mod5), 2=10(mod4), 20001 = —99 (mod 100). 
Let Z, denote the set {0,1,...,n — 1} in which addition and multiplication 
are carried out mod n. (Sometimes 0 is replaced by n.) For example, in Zo, 
7 x 8 = 2 since 56 = 2 (mod 9). 
Here are the addition and multiplication tables for Zs. 


0 1 
1 2 
2 3 
3 4 
4 0 


4 | i222 4}0 43 21 


If p is prime, Z, has the properties that characterise a field. Note that if plab 
then pla or p\b, i.e. if ab = 0 (mod p), then a = 0 or b = O. Thus the multi- 
plication table for Z, — {0} has no Os. (This is unlike the situation in Ze, for 
example, where 2 x 3 = 0.) Next observe that if p { t (i.e. p does not divide 
t) then t, 2t,... ,(p — 1)t are all distinct (mod p): for if ta = ¢b (mod p) then 
p\t(a — b) whence p|(a — 6), ie. a = b (mod p). Thus t, 2¢,... ,(p — 1)t are just 
the same as 1,2,...,p— 1, but in a different order. The first, consequence of 
this is that there must be some s,1 < s < p—1, such that st = 1 (mod p). Thus 
each nonzero member ¢ of Zp has an inverse which we can denote by t7?. For 
example, since 2 x 3 = 1 in Zs, the inverse of 2 is 3, i.e. 2-1 = 3 in Zs. 
The second consequence arises from the congruence 


t.2t...(p—1)t =1.2....(p—1) (modp) 
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ie. 
t?-(p — 1)! = (p— 1)! (mod p). 
Multiplying by the inverse of (p — 1)! gives Fermat’s theorem 


t?-! =1(modp) whenever p{t. 


For example, 


There is a further consequence of the existence of inverses. First note that 
the only two numbers (mod p) which coincide with their inverse are 1 and 
1. (For z? 216 (x -1)(2+1) = 0 & c = 1 or — 1 (modp).). Consider 
(p— 1)! =1.2.3...(p— 2).(p— 1). The numbers from 2,... ,p — 2 must consist 
of $(p — 3) pairs of numbers and their inverses; and so the product of all these 
numbers must be 1. So (p — 1)! = 1.1.(7 — 1) = —1 (mod p). Thus we obtain 
Wilson’s theorem: 


(p — 1)! = -1 (mod p). 


Squares and non-squares in Z, 

Let. p be an odd prime: p = 2k + 1. Then the numbers 17,2?,... ,k? are all 
distinct (mod p); for if x? = y? then (« — y)(x + y) =0(mod p) so that zx =y 
or x = —y (mod p). But x = —y is impossible, so x = y. 

Since (p— x)? = (—x)? = x? (mod p), it follows that, of the p—1 = 2k nonzero 
numbers (mod p), exactly half are squares and half are non-squares. Note that: 
(Al) if 2 and y are squares in Z, then so is zy (for if x =u? and y =v? then 

zy = (uv)”); 
(A2) if z is a square and y is a non-square in Z, then zy is a non-square (for 
if z = u? and zy = w? then y = (u-!w)? would be a square). 
Further, note that if —1 is a square then —1 = x? (mod p) for some = € Z,. 
Raising both sides to the power 3(p— 1) = k gives (-1)* = z?-). But 2?-1 S1 
by Fermat’s theorem, so (—1)* = 1(mod p). Thus k must be even and so p = 
1 (mod 4). Thus: 
(A3) if p= 3 (mod 4), —1 is a non-square. 
It then follows from (A2) that: 
(A4) if p = 3 (mod 4) then z is a square & —« is a non-square. 
Example. Take p = 11. The squares (mod 11) are 1,4,9,16 = 5 and 25 = 3. 
The negatives of 1,3,4,5,9 are 10,8, 7,6, 2, ie. the non-squares. 


Now consider p = 1 (mod 4); here we show that —1 is a square (mod p). 
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Note that, ifp —4k41 
f) r fy Ty 
(p—-1)! = 1.2....2k.(2k 4+1)...4k 
1.2....2k.(—2k)...(—1) (mod p) 


(—1)?* (2k)\(2k)! = ((24)!)?, 


So that (p — 1)! is a square. But, by Wilson’s theorem, (p ~— 1)! = —1 (mod p), 
so —1 is a square. Thus we have: 


\ote that «| (A5) if p= 1 (mod 4) then —1 is a square (mod p), and = is a square  —z is a 
re 1 and ‘ square. 
Consi 
ie Example. Take p = 13. The squares (mod 13) are 1,4,9,16 = 3,25 = 12 = 
se ~1, 36 = 10, and their negatives are 12,9, 4,10, 1,3, ie. the squares. 
all these 
e obtain F : 
Further number theoretic results can be found in the recent book by Jones 
and Jones [13]. 

are all 
trey 
lonzero 
se that: 

” then 

we (for 

E Zp. 

= =] 

io p= 
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Solutions 


Chapter 1 


1.1 9! 
1.2 (a) (72) (2), () (3) — (6) — (72), © Does Cr) (e2,)- 
1.3 Yes, since 52! > 10°. 
1.4 : (9) ($)/(@) = 0.1765... 
b) (2)(8)/(@) = 0.00097... 

c) (8) (2)/(2) = 0.00002... 
1.5 1 in (a) 6724520, (b) 43949268 
1.6 1 in 14(%*) = 3895 584; so about 4 times as likely. 

1.7 (‘°) /2°. 

1.9 Add (i) and (ii) of Theorem 1.7. ; 
1.10 Equate coefficients of x” on both sides. Then put n = 5 +1. 


1.11 (b) In resulting sum, terms cancel in pairs. (c) (";1) = coefficient of «* in 
(1+ 2)"(Sopeo(-1)"2") = De gene (*)(-D? = (-1)* Dh (-1)"(")- 
1.12 LHS =n}, (7) (G73) =n(™2"7) by Exercise 1.10. 


1.13 (a) (4871) = ('8). (6) (1) on putting z = 1+, etc. (c) Put c = 3+a,y = 
-1+b,z=1l+c,w=—2+d to get ('f). 


1.14 Solve a1 +...+ 24 +25 =6 with each 2; >0: (?). 


1.15 Consider a choice of 6 from 49 as a binary sequence of length 49 with exactly 
six Os. Want no two Os adjacent. By Example 1.17, number is (“!) > 1(*). 


2\6 
1.16 (a) 10 x 9 x 8 x 7, (b) (7), (c) 104, (d) (tA). 
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17 Sgt (2) = nT? (*c}) = n27-! by Theorem 1.7(i). Average size is 
Poker 
2 an Lar", 

18 (a) Summing gives 17+ ...+n? = OR +(*P) = an(n + 1)(2n +1). 
(b) Summing gives P+. ee Sy eee a eS ee) 


by Theorem 1.8. 
19 Let S = (7) + (8) +.... Then de Moivre gives e3""* = (1+ w)" = 


S+u{(t) + G) +---} + w?{(Q) + (§) +---}. Taking real parts gives 
cos = S— 3{(T) + (2) + (7) + (8) + } = S— 5(2" — 8). 
Chapter 2 


1 (a) ag = 145 +...+ phe t+ pers = 145 4+...4 sdy = 2(1- d), 
(b) 39 — 29-1, 
(c) (2n — 1)3"7?, 


(d) Gn = 4@n_) — 3@n_2 has solution a, = A+ B3”. For particular solution 
try a, = K2". Get K = —4. So try a, = A+ B3" — 2"+?. Get 
= gntl _ gnt2. 


2.2 bn = Lat+l- 


2.3 Let c, = number ending in 1 = number ending in 2. Then dy, = d,_1 +2€n. 
But cn = dn-1 — Cn-1, SO get dyn = 2dn-) + dn—2. Since d, = 3 and 
dy = 7,dy = (1+ V2)"4) + 21 - V2). 


2.4 (a) f(z) on, +(dag+1)z3+--- = baf(z)ta(lt+ct+z7+---), 
sO f(z)Q-4$2) = ie. f(z) = = acaacks - erage oF) whence 


oft 


On = 2(1— x). 
(b) f(x) = (62 —1)( gs — z=) whence a, = 6(3"-1 — 27-1) — (3 - 2"). 
2.5 fn = fn—-1 + fn—2) 80 fn = Fy-1 (n> 2). 
2.6 Gy, = 3(27-1 — 27-2 4... 4 (-1)"-72) = 6(-1)"(1 —-2 +--+ + (—2)"-?) = 
2” + 2(-1)". 
2.7 Ly = 2428)" + (15x8)e 
10 f(t) = Fete? + (A+ s+ (t+ Py)ct +--+) = Rat Fox? + 
a? (Fyn t+ +++) +¢(For? +---) = 24 2a? + x? f(x) + 2( f(x) — 2), whence 
f(z)(’-2—2?) =a2+2?. Thus f(z) = +a -1= Aer - 5z)-1 
whence F, = Jg(a"*? + 82+). 
.11 (a) By induction. (b) Fr Fni2F2,, = detM"+? = (detM)"+? = (-1)"+? = 
(—1)". (c) Write all three matrices as in (a) and then equate top left entries. 
.12 Induction step: Fyt:+-+Fyt+ Frei = (Frye - 2) + Fy41 = Fya3 —2. 
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13 (a) Fan — 1, (b) Fonsi — 1, (c) (-1)"- Fat. 


.14 a, = no. in which 1 stays put + no. in which 1 and 2 change places 
=G4n_-1 +Gn_24+ 1. Soa, = F, - 1. 


.15 (a) There are gn, such subsets of {1,...,7} not containing n, and gn—2 
containing n; 80 gn = 9n~1 + 9n—2- (b) Take p= k,g=n—k. (c) Fry = 
Sie 

.17 Label points 1,... ,2n. If 1 is joined to 2r + 2, circle is divided into two 
parts, one with 2r vertices 2,... ,2r +1 and the other with 2(n —r— 1) 
vertices. So @, = 0, 4-an—,—1- Check that a) = 1, a2 = 2. 


18 Use (2.14). 
.19 Induction step: dy41 = kdy + kdy_1 > kd, > k.(K — 1)! = ki. 


.20 Number of comparisons is < 1+2+---+(n-l1) = En(n — 1): same as 
bubblesort. 


.21 Eliminate y, from two given equations. Auxiliary equation has roots 1 and 
#,80 fn = A+ (#)"B, soc, 9 A. But m = A+Band A+ $B=2, = 
329 + tyo- Eliminate B to get A = S40 — Xo. 


22 dn. 


Chapter 3 


3.1 Use Corollary 3.2. 
3.2 p= 3n+3,2q = 4n+2+2n+2=6n+4,s0q=p-—1. Apply Theorem 
3.4(iii). 


3.3 Each component must have > 1 + 3(p — 1) > § vertices, so there can be 
only one component. 


3.4 (a) 4, (b) 40. 2 
3.5 g—(p-—1). 
3.6 (a) Spanning tree has 4 edges, so one of the x; must have 2 edges from it. 


(b) The x; joined to both a and b can be chosen in 3 ways; there are then 
2 choices of edge from each of the other x,: so 3 x 2? = 12. 


* (©) 100 x 29°. 
3.7 (a) Choose in order AE, DC, AC, AB. (b) Choose AE, AC, DC, AB. 
3.8 Using Prim, choose HM, HEK,HA,GEK. 
3.9 11 flights required. 4+ 4/2 +3V5. 
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3.10 (a) Suppose planar; p = 6,q = 9 gives r = 5. Bipartite, so get 18 = 2g > 
4r = 20, contradiction. 


(b) Yes - draw 2 edges “outside”. 
(c) No. If so, p = 8,g = 19,r = 13, so 38 = 2g > 3r = 39. 
(d) No. If so, p = 11,q = 10,r = 11, so 40 = 2g > 4r = 44. 
3.11 6. 
3.12 (a) (i) No. (ii) Yes (draw edges y,2,,, “outside” ). 


(b) @n = no. using tpynt+ no. not using TAYn = An—1 + 2an—-2. SO On = 
(ant? — (-1)"). 


3.13 (a) p-~q+r = 2 and 2g > 3r, so 2g > 3(2 + q — p), ie. q < 3p— 6. For 
K;,3p-6=9 < 10=g. 


(b) p—q+r = 2 and 2g > gr, so 2g > g(2—pt+q), ie. (g—2)q < g(p— 2). 
(c) Use g = 4 for K33 and g = 5 for Petersen graph. 


3.14 Follow Example 3.14. 2¢ = 3p,r =s+h,2q= 48+6h,p—q+r = 2 yield 
s=6. 


3.15 The lines of cut and circular arcs form a graph with 2n vertices of degree 
3 on outside and (3) vertices of degree 4 inside. So p = 2n + (3),2g = 
6n + 4(§), sor =n + (3) + 2, including infinite region. 


3.16 If g, denotes the number containing edge zorn, then hyn = 2hn—-1 + Gn-1 
and gn = An — ha-1; thus hn = 3hAn-) — Ay—o- 


Chapter 4 


4.1 (a) B and W must alternate. (b}) m = |B| = |W| = 7 if hamiltonian. 
4.2 (a) (i) and (ii) are hamiltonian. (iii) is not by Exercise 4.1(a). 
(b) None. 
(c) (iii). 
4.3 All hamiltonian; only octahedron is eulerian. 
4.4 Only (a) is planar. 


4.5 00000 — 01000 — 11000 ~ 10000 — 10100 — 11100 — 01100 — 00100 — 00110 — 
01110 — 11110 — 10110 — 10010 — 11010 — 01010 — 00010 — 00011 — 01011 - 
11011 — 10011 — 10111 — 11111 — 01111 — 00111 — 00101 — 01101 — 11101 - 
10101 — 10001 — 11001 — 01001 — 00001 — 00000. 


4.6 Let A = {j : v; and v; are adjacent} and B = {j : v;_1 and vp, are 
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adjacent}. Then A C {2,...,p—1},|A] > §,B ¢ {3,... ,p}, |B] > 83 so 
ANB. 


4.7 (a) Imitate proof of Dirac. |A| = deg(v), |B| = deg(vp). Since deg(v1) + 
deg(vp) > p, get AN B # O again. 
(b) If there are non-adjacent vertices u, w such that deg(u)+deg(w) < p—1, 
then G has at most (?37) + (p— 1) = 4(p—1)(p— 2) + 1 edges. 
(c) Take a vertex v in Kp and remove all but one edge from v. 


4.8 Remove A: (2 +4) +13 = 19. Remove B: (6+6) +9 = 21. Upper bound: 
the spanning tree AE, EC,C'D, AB give upper bound 23(AECDBA). 


4.9 Exact value is 37. 


4.11 (i) Take vertices 0,1,2, and all possible directed edges. Eulerian circuit 
gives 001122021 in cyclic order. 


(ii) Take vertices 00,01,... ,22, and draw directed edge from ij to jl. One 
solution is 000101112122210220120200211. 


4.12 K7, with a loop at each vertex, is eulerian. With vertices 0,... ,, Kn41 
with loops is eulerian < n even. So arrangement is possible = n even. 
For n = 6, one eulerian circuit is 00112233445566024613503625140, giving 
cyclic arrangement of dominoes (0,0), (0,1), (1,1), (1,2),... ete. 


4.13 K, eulerian + n odd. Required arrangement corresponds to eulerian cir- 
cuit. For n even, duplicate each edge to get an arrangement with every pair 
adjacent twice. 


Chapter 5 


5.1 cod 
5.2 Dues 
5.3 (13) a6I- . 
5.4 shh; x 2. 
n 


5.5 (a) One of the parts must have two elements: choose it in (3) ways. 


(b) There is either one set with 3 elements or two sets with two. In second 
case choose 4 elements and then partition into two pairs. Alternatively 
use (5.2) and iteration. 


5.6 Induction step: S(k + 1,3) = S(k,2) + 3S(k,3) > 35(k,3) > 3.3&-? = 
3(k+1)-2, 


5.7 nis in a part with é others, 0 < £<n—k; so S(n,k) = Se ("7)S(n = 
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1-2,k—-1). Putm=n—-1-£. 


B(n) 1+ pe 25(n, k) = 14 Dheg Se -1 ("5") Sm, k - 1) 
tt Spe 1 ("') SRA Sm, k - 1) 
1+ no ah _, ("2 1) B(m). 


5.8 Use Theorem 5.6. B(10) = 115975. 


5.9 BE+1) = Dro (m)B(m) = $Lmeo (%) Djeo or 
= Lye, t narra Pant 
= 2 Ljzo fl (+3) = pe eae 
5.10 n either forms a 1-cycle on its own or can be slotted into a cycle in one of 
the s(n, k — 1) permutations of 1,... ,n — 1 (think of n inserted after one 


of the other n — 1 elements). 5(5, 2) = 50, s(6, 2) = 274. 
5.11 x = 3,3,2. x =4,5,4. 
5.12 Each colour can be used at most a(G) times. 
5.13 (a) Vertices 1,... ,8 get colours 1, 2,3, 4,3, 1, 2,3. 
(b) Vertices 8,... ,1 get colours 1,1, 2,3, 4, 2, 5,1. 
5.14 (a) Vertices 8,1,3,6,7,2,5,4 get colours 1,1,2,2,1,3,4, 5. 
(b) Vertices 4,5, 2,7,6,3,1,8 get colours 1, 2,3, 3,4, 2,4, 1. 


5.15 Colour the vertices with colours c1,... ,c4. Then order vertices so that all 
coloured c, come first, then those coloured cz, etc. 


5.16 All are class 1. 


5.17 Colour edges of hamiltonian cycle using 2 colours. Remaining edges get a 
third colour. 


5.18 (a) Use Theorem 3.1. (b) Any matching can have at most k edges so 
x'(G) > E(k+ $)r >. 
5.19 (b) Induction step. Remove a pendant vertex v: T — {v} can be coloured 
in A(A — 1)* ? ways, and then v can be coloured in (A — 1) ways. 


(c) In a colouring of G’, x and y may get same colour, so subtract f,(G"). 
For deduction, use induction on the number of edges. 

(d) Recurrence an = A(A — 1)?7! — an_i gives Qn — (A — 2)Qn-1 — (A- 
1)an_2 = 0. Auxiliary equation is (x + 1)(@ —’A+1) =0. 
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Chapter 6 

6.1 |AUBUCUD| = |A|+|Bl/+|Cl+|D|-jANB]-|ANC|~|AND 
-|BNC|-|BND|-|CND\|+|ANBNC| 
+ANBND|+|ANCND| 
+|BNCND|-|ANBACND|. 

6.2 100 = 67+ 2 — 44 gives s = 77 


6.4 1000 — 142 — 90 — 76+ 12+ 10+6-—0 = 720. 

6.5 (a) |Si| > 75,|SiNSa| > 150-100 = 50, |S;9S2NSs| > 75+50—100 = 25, 
(b) [S| = m(n — 1), {SN S2| > 2m(n — 1) - mn = m(n — 2), ete. 

6.6 Imitate Example 6.7: (197) — (2) — (3) - (2) + (2) + @) + (2) = 231. 


6.7 Take P,: pattern 12 occurs,..., P4: pattern 78 occurs. N(i) = 7!, N(i,7) = 
6!, etc. Obtain 8! — 4.7! + 6.6! — 4.5! + 4! = 24024. 


6.8 Take P; : 2i is in position 27 (1 <i < 4). Get 24024 again. 
6.9 |S] =8 x 7 x 6 = 336. Get 336 — 3(7 x 6) + 3(6) ~ 1 = 227. 


6.10 Let S = set of all permutations; then at ate aw Take P;: the two is are 


puaee Then N(1) = $,.N(1,2) = $5, etc. Get 1 - 54° +10- 8 —10- 
A+5-8 & — 5! = 39480. 
= ace Cao -_.1 
6.11 ¢(n)=n-VA+h) aes = nlTpin(1 — 5): 
7 (100) = 40, ¢(200) = 80. 
6.12 fx(G) =A" - HNQ@)4+¥5 Ni, 7) —---. N(i) = A"! (both ends of edge e; 
get same en Also N(i,j) = A"~?. (There are two cases: ¢;,¢; may or 


may not have a common vertex - get \"~? in both cases.) Also, N(i, j,k) = 
A"—? if e;,e;,e form a 3-cycle, and is A"—3 otherwise. 


6.13 S = {(4,...,212) : 1 < 2; < 6}. P; : no 2; has the value i. Number of 
throws in which all numbers appear = |S} — 35> N(i) + © N(i,j) - 
612 (8) 51? 4 (§) 412 — (8) 312 + ($) 212 — (°) = 953029 440. To get probability, 
divide by 6}. 

6.14 If S = set of all partitions into 4 parts, [S| = S(10,4). Let P; : {i} is a 
singleton set. So N(i) = S(9, 3), N(z, 7) = S(8,2), etc. Answer is S(10, 4) — 
(1°) S(9, 3) + (2) S(8, 2) — ('s)S(7,1) = 9450. Alternatively, want number 
of partitions of type 234! or 2737, i.e. sagt + POTD ETE 
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Chapter 7 3 


71 


Qj =a > 24+ 57-2S2+)-A2a> jez Jo j= J. 

aij = ayy > 26497-2521 + 9-2 > 21 = 2 (mod n) => § =I since nis 
odd. So A is a latin square. Similarly for B. 

Orthogonality: aj; = ayy and bj; = by3 > 2047 = 214+ J and 3i+j= 
3+ J >i2I>j2= J. 

Broken diagonal of A starting at a,; consists of j,j +1+2,j+244,..., 
ie. 7,7 +3,74+6,.... But 7+ 3u= 34+ 30 > 3u = 30 Su Sv since 3} n. 
So all entries in diagonal are different. 


7.7 (b) Msg is. 
(c) Suppose aj; = a;7 and aJ, = af. Then 21+ j = 22+ J and 2j+i= 
2J 4+ I. Subtracting gives i — j = I — J whence J = J —i+j. Thus 
24+7 = 21+] -it+j,ie. 3i=37, s01=1,j = J. 


7.8 (a) 1 appears an even number of times off the diagonal in symmetrically 
placed positions. So it appears an odd number of times altogether, so 


nis odd. 

(b) ay = ay > j(m4+1) = J(m4+1) > Am+])j = Am4+1)J S57= 
J(mod 2m + 1). Similarly for columns. Since 2(m 41) = 1{mod n), 
ith diagonal entry is 2(m + 1)i =i. For m = 2, A is 

142 5 3 
4253 1 
25 3 1 4 
5 3 1 4 2 
3.14 2 5. 


7.9 Orthogonality. Suppose A;A4 +Aj; = ArAg +Az and AjAn +A; = AAR +AZ. 
Subtracting gives (An — An) = Ar(Ak — An), Le. As = Az since An # Ak; 
soi =I. Thus A; = Ay, ie. j = J. 


.10 The rows are the columns of (7.3) in a different order. 
-12 For example 3, 4, 2, 1. 
13 {2, 5,6}, {2,6}, {2, 5}, {5,6} have only 3 elements in their union. 
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7.14 Ignore suits; there are 4 cards of each value. Any & columns contain 4k 
cards which must have at least 4k =k different values among them. Apply 
Hall’s theorem. 

7.15 Any k A; contain kn elements in their union. These kn elements must be 
distributed among at least fn = k sets B;, so the union of any k S; contains 
at least k elements. oe 

7.16 Take the 5 x 5 array N with 1,...,25 in natural order. The 5 rows 
and 5 columns give 10 blocks. Corresponding to Aj, get {5,9, 13,17, 21}, 
{1, 10,14, 18,22}, {2,6, 15, 19, 23}, {3, 7, 11, 20, 24}, {4, 8, 12, 16, 25}. 
Similarly for A2,... , As. 

7.17 Take A; = {(t,j) : mij = 1} and B; = {(j,7) : mj, = 1} where M = (mj;). 
Apply Exercise 7.15 to obtain a permutation matrix P,. (Alternatively 
apply Theorem 7.6 or Theorem 5.13.) Then repeat argument applied to 
M — P,, to get another permutation matrix P2. Then consider M — P; — P2, 
etc. 

Chapter 8 

8.1 First round has games cov 1,9v2,8v3,7v4,6v5. For 9 teams omit games 
involving oo. 

8.2 


T,VY2, T2VY3, L3VY4, TeVY5, TavVY 
T1VY3, T2VY4, L3VY5, L4aVYi, Th VY2 
T1VY4, T2VY5s, VY, TavVyo, FHV YB 
T1VY5, LavVY1, TaVY2, LaVY3, Be VYs 
TivVYi, T2VT5, BWeVUT4, Y2VY5, Ys Ya 
T2VY2, T3V%1, VevVT%5, Y3IVY1, Yavys 
T3VY3, T4evV%2, TeVTM1, Yavyz, YsvYr 
TaAVY4, T5HVIZ, 1 VF%Z2, Y5VY3, YWvVye2 
T5VY5, T1V%4, X2VIT%3, Yrvys, Yovys 


8.3 Yes, by Theorem 7.5. 


8.4 


(a) The spokes, (b) none, (c) edges 12, 34,56 in Figure 4.1(a), (d) “vertical 
edges” . ; 


8.5 In Figure 4.1(a) take 1236451 and 1652431. Each gives two 1-factors. 
8.6 All. 


8.7 Hamiltonian cycle gives two 1-factors; remaining edges form another. 


8.8 (a) Ai v Bi4-1 in round k, where suffixes are reduced (mod 5) to lie in 


{1,... ,5}. 
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(b) Use the join of the first two MOLS of Example 7.3 to get first round 
Az v Bs on court 1, A3 v By on court 2, Ag v Bs on court 3, As v By on 
court 4, A, v Bz on court 5, etc. 


8.9 Follow the method of Section 8.3, using the first 3 MOLS Aj, A2, A3 of 
Example 7.3. Then use Aq to determine the courts. Get following schedule 
where rows give rounds and columns correspond to courts. 


BsG, v big BiG, Vv bs 95 BoGe Vv bsga B3Gs5 v bogs BiG; v baga 
BiG, Vv bs 93 BzG2 Vv bego B,Gs a‘g bagi ByG3 Vv bigs B3G, Vv b3ga 


B3Go Vv bags BG; v bi 94 BsGs Vv bs gs B, G; Vv bsgo BoG4 Vv begi 
BoGs v b392 BsG3vbsgq. BaGivbogs BsGavbsgs BiG2 v bigs 
B,G3v bogs BoGivbsags B3G4vbig2 BsGovbsg: BsGs v bsgs 


8.10 (a) Only sequences with no breaks are HAHA...H and AHAH...A. 
But no two teams can have the same venue sequence (otherwise they 
couldn’t play each other). 


(b) oo and 0 have no breaks. Each other has a break next to its game 
against oo. 


8.11 Draw a bipartite graph with n black vertices labelled by the first round 
pairs and n white vertices labelled by the kth round pairs. Join a black 
to a white when the labelling pairs are not disjoint. By Theorem 7.6 with 
m = 2 there is a perfect matching, in which each edge represents a team. 
Play these teams at home in kth round. 


Alternatively, apply Exercise 7.15 to the pairs of the first and Ath rounds: 
choose a common SDR and play these teams at home in round k. 


Chapter 9 


9.1 (a) 6 not an integer. (b) b= 14 < v. 

9.2 Complement of a (13,4, 1) design. 

9.3 (a) ur(k ~DA—r(k— LA = Aw — I)r(k- 1) = 17? (k ~ 1). 
(b) (k —1)A4 (vu —k)A = A(v — 1) + r(k — 1). 

9.4 Use vA = k? —k +A. 


9.5 (k—1)’—! must be a square, so, dividing by (k — A)”~”, which is a square, 
k — X must be a square. For final parts must check that the two designs 
would be symmetric. 


9.6 (a) (n? +n,n?,n? —1,n?—n,n? —n—-1), 


(b) (4m —1,4m —1,2m,2m,m). 
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9.7 
9.8 
9.9 
9.10 
9.15 


9.16 


9.17 


9.18 
9.19 


9.20 


9.21 


Use H, and Theorem 9.15. . 

Use Theorems 9.18 and 9.16. 

In each case the differences are +(d; — d;). 
— 9.14 Check differences. : 


Given 7, there are 10 associates in same row or column, and 5 associates 
given by L. So each block has 15 elements. Given 7 and 7 in the same row 
of N, they are in 4 B, where k is in the same row as i and j, and in B, 
and By, where h(é) is the element of N in same row as i(j) such that h(2) 
and j(i) have equal corresponding entries in L. Similarly for other cases. 


(a) B =2A-— J has 1 where A has 1 and —1 where A has 0. 
(b) BTB (2A? — J)(2A ~ J) =4ATA-2ATI-2FA4 J? 


4(k — A)I +447 — SA)? - 2A + J? 
4(k — A) + (4A — 4k +0) J =0l © 4(k- A) = 0. 


(c) v = 36,k = 15,A = 6 satisfy 4(k — 1) =v. 


Orders 4,8,16,32 by doubling; 12,20, 24, 44,48 by Theorem 9.18; 36 by 
Exercise 9.16; 40 by doubling 20; 28 by extending Theorem 9.18 to prime 
powers, using the finite field of 27 elements. 


(9.9) becomes S > (%)d; (9.10) unchanged. 


NH 


If d(x,y) = e then x+y has weight e; so min. weight < min. distance. 
Conversely, if w(x) = w then d(x, 0) = w; so min distance < min weight. 


Can assume 0 is in code. All sequences of weight 1 get corrected to 0. 
There are no codewords of weight 2 (by perfectness) so all sequences of 
weight 2 must get corrected to a codeword of weight 3. Use correspondence 
between subsets A of {1,... ,m} and binary sequences x4 = 2 ... Zp, where 
a; = 1 #1 € A. The 3-element subsets corresponding to codewords of 
weight 3 form an STS(n): if |B| = 2,B C {1,...,n}, Bis in C where x, 
corrects Xp. 


(a) Count in two different ways the pairs (A, P) where P is a pair of 
elements of B and A is a block other than B containing P. 


(b) Count in two different ways the number of pairs (A,y) where y is an 
element of B and A is a block other than B containing y. 


(d) Expanding 7,(i — m)*2; > 0 gives 0 i?2; — 2m Dia, + m? Yo a; > 0. 
Write i? as 2(3) + 4; then (a) and (b) give k(k —1)(A—1)+m?(b—1) > 
(2m — 1)k(r — 1). Replacing m(b — 1) by k(r — 1) gives (d). 

Next, in (d) replace (b —1)k by ur — k to get (ur —k)(K-—DAt+or(r - 
k) +k? — rk > k?(r — 1). Now use Exercise 9.3(a) and take r — k& out 
as a common factor to get (r — k)(r — 2rk + (kK~-1)A+ vr) > 0. Finally 
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b) to deduce that (r — k)(v — k)(r — 4) > 0, whence 
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Further Reading 


The main ideas of the first two chapters are covered by most of the standard 
textbooks on combinatorics, e.g. Brualdi [5], Cameron [6] and the remarkably 
extensive book by Graham, Knuth and Patashnik {11]. Chapters 3 to 5 cover 
some of the basic ideas in graph theory; further details can be found in the 
many textbooks now available, of which we particularly note those of Diestel 
[9], Wilson [20], Wilson and Watkins [21], and the English translation [15] of 
the ever-fresh classic of Kénig [14]. The reader who is interested in the history 
of graph theory is directed to the recent reissue of the book by Biggs, Lloyd 
and Wilson [4]. 

The inclusion-exclusion principle has excellent expositions in [5] and in van 
Lint and Wilson [18]. The proof of Cayley’s theorem on labelled trees, using 
inclusion-exclusion, is due to J.W. Moon [19], and there is a very nice article 
by J. Dutka on the ménage problem in [10]. 

Latin squares are covered in great detail by Dénes and Keedwell [7], [8], and 
are the central theme of a recent book by Laywine and Mullen [16]. There is 
also material on latin squares in [1], [5], [6] and [18]. Connections between latin 
squares and tournaments are described in [1], where the topics of Chapters 7-9 
are dealt with more fully. [1] also deals with block designs, as do [16] and [18], 
and there are good accounts of coding theory in the books by Baylis [3], Hill 
[12] and Van Lint [17]. 

Another important source of information is the web. Search for Fibonacci 
numbers or derangements or latin squares, and you will be lead to many fasci- 
nating and informative sites. Also freely available on the web is a short course 
on designs [2]. 
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